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COUPLED POLYMERASE CHAIN REACTION-RESTRICTION 
ENDONUCLEASE DIGESTION-LIGASE DETECTION REACTION 

PROCESS 

This application claims the benefit of U.S. Provisional Patent 
Application Serial No. 60/125,251 filed March 19, 1999. 

This invention was developed with government funding under 
National Institutes of Health Grant Nos. GM-41337-06, GM-43 552-05, GM- 
42722-07, and GM-5 1628-02. The U.S. Government may have certain rights. 

FIELD OF THE INVENTION 

The present invention is directed to a process involving coupled 
polymerase chain reaction ("PCR"), restriction endonuclease digestion ("RE"), 
and ligase detection reaction ("LDR"). 

BACKGROUND OF THE INVENTION 

Cancer Detection 

As the second leading cause of death in this country, almost 
600,000 people will die from cancer per year making cancer one of the most 
alarming of all medical diagnosis. Lifetime risks for developing invasive cancers 
in men and women are 50 percent and 33 percent, respectively. Expectations are 
that more than 1 .2 million new cases of cancer will be diagnosed in the United 
States in 1995. Healthcare expenses for cancer in 1994 were approximately $104 
billion. However, the full impact of cancer on families and society is not 
measured only by the amount of money spent on its diagnosis and treatment. A 
significant number of people are stricken with cancer in their most productive 
years. Cancers accounted for 18 percent of premature deaths in 1985 and in 1991 
more than 9,200 women in the U.S. died from breast cancer before the age of 55. 

Currently, diagnosis of cancer is based on histological evaluation of 
tumor tissue by a pathologist. After a cancer is diagnosed, treatment is determined 
primarily by the extent or stage of the tumor. Tumor stage is defined by clinical, 
radiological, and laboratory methods. Standardized classification systems for the 
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staging of tumors have been developed to clearly convey clinical information 
about cancer patients. Staging provides important prognostic information and 
forms the basis of clinical studies which allow the testing of new treatment 
strategies. A staging system was developed (TNM staging system), which 
classifies tumors according to the size of the primary tumor, the number of 
regional lymph nodes in which cancer is found, and the presence or absence of 
metastases to other parts of the body. Smaller cancers with no affected lymph 
nodes and no distant metastases are considered early stage cancers, which are 
often amenable to cure through surgical resection. A common measure of 
prognosis is the 5-year survival rate, the proportion of patients alive five years 
after the diagnosis of a cancer at a given stage. While 5-year survival rates for 
many cancers have improved over the last few decades, the fact that some early 
stage cancers recur within five years or later has led researchers to explore other 
additional prognostic markers including histological grade, cytometry results, 
hormone receptor status, and many other tumor markers. Most recently, 
investigators have explored the use of molecular alterations in cancers as 
prognostic indicators. 

Genetic alterations found in cancers, such as point mutations and 
small deletions can act as markers of malignant cells. 

Detection of Minority Nucleic Acid Sequences 

A number of procedures have been disclosed to detect cancer using 
PCR. Sidransky, et al., "Identification of ras Oncogene Mutations in the Stool of 
Patients with Curable Colorectal Tumors," Science 256: 102-05 (1992) detects 
colon cancer by identification of K-ras mutations. This involves a PCR 
amplification of total DNA, cloning into a phage vector, plating out the phage, 
repeated probing with individual oligonucleotides specific to several different K- 
ras mutations, and counting the percentage of positive plaques on a given plate. 
This is a technically difficult procedure which takes three days to complete, 
whereby the ratio of mutant to wild-type DNA in the stool sample is determined. 
Brennan, et al., "Molecular Assessment of Histopathological Staging in 
Squamous-Cell Carcinoma of the Head and Neck," N. Engl. J. Med. 332(7): 429- 
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35 (1995), finds p53 mutations by sequencing. This specific mutation is then 
probed for in margin tissue using PCR amplification of total DNA, cloning into a 
phage vector, plating out the phage, probing with an individual oligonucleotide 
specific to the mutation found by sequencing, and counting the percentage of 
positive plaques on a given plate. Berthelemy, et al., "Brief Communications- 
Identification of K-ras Mutations in Pancreatic Juice in the Early Diagnosis of 
Pancreatic Cancer," Ann. Int. Med. 123(3): 188-91 (1995) uses a PCR/restriction 
enzyme process to detect K-ras mutations in pancreatic secretions. This technique 
is deficient, however, in that mutations are not quantified. Similarly, Tada, et al., 
"Detection of ras Gene Mutations in Pancreatic Juice and Peripheral Blood of 
Patients with Pancreatic Adenocarcinoma," Cancer Res. 53: 2472-74 (1993) and 
Tada, et al., "Clinical Application of ras Gene Mutation for Diagnosis of 
Pancreatic Adenocarcinoma," Gastroent. 100: 233-38 (1991) subject such samples 
to allele-specific PCR to detect pancreatic cancer. This has the disadvantages of 
providing false positives due to polymerase extension off normal template, 
requiring electrophoretic separation of products to distinguish from primer dimers, 
being unable to multiplex closely-clustered sites due to interference of overlapping 
primers, being unable to detect single base or small insertions and deletions in 
small repeat sequences, and not being practically suitable for quantification of 
mutant DNA in a high background of normal DNA. Hayashi, et al., "Genetic 
Detection Identifies Occult Lymph Node Metastases Undetectable by the 
Histopathological Method," Cancer Res. 54: 3853-56 (1994) uses an allele- 
specific PCR technique to find K-ras or p53 mutations to identify occult lymph 
node metastases in colon cancers. A sensitivity of one tumor cell in one thousand 
of normal cells is claimed; however, obtaining quantitative values requires 
laborious cloning, plating, and probing procedures. In Mitsudomi, et al., 
"Mutations of ras Genes Distinguish a Subset of Non-small-cell Lung Cancer Cell 
Lines from Small-cell Lung Cancer Cell Lines," Oncogene 6: 1353-62 (1991), 
human lung cancer cell lines are screened for point mutations of the K-, H-, and 
N-ras genes using restriction fragment length polymorphisms created through 
mismatched primers during PCR amplification of genomic DNA. The 
disadvantages of such primer-mediated RFLP include the requirement of 
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electrophoretic separation to distinguish mutant from normal DNA, limited 
applicability to sites that may be converted into a restriction site, the requirement 
for additional analysis to determine the nature of the mutation, and the difficulty in 
quantifying mutant DNA in a high background of normal DNA. Further, these 
procedures tend to be laborious and inaccurate. 

Coupled PCR/ligation processes have been used for detection of 
minority nucleotide sequences in the presence of majority nucleotide sequences. 
A PCR/LDR process is used in Frenkel, "Specific, Sensitive, and Rapid Assay for 
Human Immunodeficiency Virus Type 1 pol Mutations Associated with 
Resistance to Zidovudine and Didanosine," J. Clin. Microbiol. 33(2): 342-47 
(1995) to detect HIV mutants. This assay, however, cannot be used for multiplex 
detection. See also Abravaya, et al., "Detection of Point Mutations With a 
Modified Ligase Chain (Gap-LCR)," NucL Acids Res. 23(4): 675-82 (1995) and 
Balles, et al., "Facilitated Isolation of Rare Recombinants by Ligase Chain 
Reaction: Selection for Intragenic Crossover Events in the Drosophila optomotor- 
blind Gene," Molec. Gen. Genet. 245: 734-40 (1994). 

Colorectal lesions have been detected by a process involving PCR 
amplification followed by an oligonucleotide ligation assay. See Jen, et al., 
"Molecular Determinants of Dysplasia in Colorectal Lesions," Cancer Res. 54: 
5523-26 (1994) and Redston, et al., "Common Occurrence oiAPC and K-ras 
Gene Mutations in the Spectrum of Colitis- Associated Neoplasias," Gastroenter. 
108: 383-92 (1995). This process was developed as an advance over Powell, et 
al., "Molecular Diagnosis of Familial Adenomatous Polyposis," N. Engl. J. Med. 
329(27): 1982-87 (1993). These techniques tend to be limited and difficult to 
carry out. 

Other procedures have been developed to detect minority 
nucleotide sequences. Lu, et al., "Quantitative Aspects of the Mutant Analysis by 
PCR and Restriction Enzyme Cleavage (MAPREC)" PCR Methods and Appl. 3: 
176-80 (1993) detects virus revertants by PCR and restriction enzyme cleavage. 
The disadvantages of MAPREC include the requirement for electrophoretic 
separation to distinguish mutant from normal DNA, limited applicability to sites 
that may be converted into a restriction site, the requirement for additional 
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analysis to determine the nature of the mutation, and difficulty in quantifying 
mutant DNA in a high background of normal DNA. In Kuppuswamy, et al., 
"Single Nucleotide Primer Extension to Detect Genetic Diseases: Experimental 
Application to Hemophilia G (Factor DC) and Cystic Fibrosis Genes," Proc. Natl. 
Acad. Sci. USA 88: 1 143-47 (1991), a PCR process is carried out using two 
reaction mixtures for each fragment to be amplified with one mixture containing a 
primer and a labeled nucleotide corresponding to the normal coding sequence, 
while the other mixture contains a primer and a labeled nucleotide corresponding 
to the mutant sequence. The disadvantages of such mini sequencing (i.e. SNuPe) 
are that the mutations must be known, it is not possible to multiplex closely 
clustered sites due to interference of overlapping primers, it is not possible to 
detect single base or small insertions and deletions in small repeat sequences, and 
four separate reactions are required. A mutagenically separated PCR process is 
disclosed in Rust, et al., "Mutagenically Separated PCR (MS-PCR): a Highly 
Specific One Step Procedure for easy Mutation Detection" Nucl. Acids Res. 
21(16): 3623-29 (1993) to distinguish normal and mutant alleles, using different 
length allele-specific primers. The disadvantages of MS-PCR include possibly 
providing false positives due to polymerase extension off normal template, 
requiring electrophoretic separation of products to distinguish from primer dimers, 
the inability to multiplex closely-clustered sites due to interference of overlapping 
primers, the inability to detect single base or small insertions and deletions in 
small repeat sequences, and not being ideally suited for quantification of mutant 
DNA in high background of normal DNA. In Suzuki, et al., "Detection of ras 
Gene Mutations in Human Lung Cancers by Single-Strand Conformation 
Polymorphism Analysis of Polymerase Chain Reaction Products," Oncogene 5: 
1037-43 (1990), mutations are detected in a process having a PCR phase followed 
by a phase involving single strand conformation polymorphism ("SSCP") of the 
amplified DNA fragments. The disadvantages of SSCP include the requirement 
for electrophoretic separation to distinguish mutant conformer from normal 
conformer, the failure to detect 30% of possible mutations, the requirement for 
additional analysis to determine the nature of the mutation, and the inability to 
distinguish mutant from silent polymorphisms. 
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Nucleotide Conversion Fidelity 

Many of the approaches to detecting the presence of a given sequence or 
sequences in a polynucleotide sample involve amplification of the minority 
sequence(s) by polymerase chain reaction (PCR). U.S. Patent No. 4,683,202 to 
Mullis, et al., and R.K. Saiki, et al. 5 Science 230:1350 (1985). In this method, 
primers complementary to opposite end portions of the selected sequence(s) are 
used to promote, in conjunction with thermal cycling, successive rounds of 
primer-initiated replication. The amplified sequence(s) may be readily identified 
by a variety of techniques. This approach is particularly useful for detecting the 
presence of low-copy sequences in a polynucleotide-containing sample, e.g., for 
detecting pathogen sequences in a body- fluid sample. However, a nonselective 
PCR strategy will amplify both mutant and wild-type alleles with approximately 
equal efficiency, resulting in low abundance mutant alleles comprising only a 
small fraction of the final product. If the mutant sequence comprises less than 
25% of the amplified product, it is unlikely that DNA sequencing will be able to 
detect the presence of such an allele. Although it is possible to accurately quantify 
low abundance mutations by first separating the PCR products by cloning and 
subsequently probing the clones with allele-specific oligonucleotides (ASOs) 
(Saiki et al., "Analysis of Enzymatically Amplified Beta-Globin and HLA-DQ 
Alpha DNA with Allele-Specific Oligonucleotide Probes," Nature , 
324(6093): 163-6 (1986); Sidransky et al., "Identification of Ras Oncogene 
Mutations in the Stool of Patients with Curable Colorectal Tumors," Science . 
256:102-5 (1992); and Brennan et al., "Molecular Assessment of 
Histopathological Staging in Squamous-Cell Carcinoma of the Head And Neck," 
N. Engl. J. Med. . 332(7):429-35 (1995)), this approach is time consuming. In 
contrast, allele-specific PCR methods can rapidly and preferentially amplify 
mutant alleles. For example, multiple mismatch primers have been used to detect 
H-ras mutations at a sensitivity of 1 mutant in 10 5 wild-type alleles (Cha et al., 
"Mismatch Amplification Mutation Assay (MAMA): Application to the C-H-Ras 
Gene," PCR Methods App l.. 2(1): 14-20 (1992)) and claims as high as 1 mutant in 
10 6 wild-type alleles have been reported (Haliassos et al., "Detection of Minority 
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Point Mutations by Modified PCR Technique: A New Approach for a Sensitive 
Diagnosis of Tumor-Progression Markers," Nucleic Acids Res. . 17:8093-9 (1989); 
and Chen et al., "A Nonradioactive, Allele-Specific Polymerase Chain Reaction 
for Reproducible Detection of Rare Mutations in Large Amounts of Genomic 
DNA: Application to Human K-Ras," Anal. Biochem. , 244: 191-4 (1 997)). 
However, careful evaluation suggests these successes are limited to allele-specific 
primers discriminating through 3' purine-purine mismatches. For the more 
common transition mutations, the discriminating mismatch on the 3' primer end 
(i.e., G:T or C:A mismatch) will be removed in a small fraction of products by 
polymerase error during extension from the opposite primer on wild-type DNA. 
Thereafter, these error products are efficiently amplified and generate false- 
positive signal. One strategy to eliminate this polymerase error problem is to 
deplete wild-type DNA early in PCR. 

Several investigators have explored selective removal of wild-type 
DNA by restriction endonuclease (RE) digestion in order to enrich for low 
abundance mutant sequences. These RFLP methods detect approximately 1 
mutant in 10 6 wild-type or better by combining the sensitivity of polymerase with 
the specificity of restriction endonucleases. One approach has used digestion of 
genomic DNA followed by PCR amplification of the uncut fragments (RFLP- 
PCR) to detect very low-level mutations within restriction sites in the H-ras and 
p53 genes (Sandy et al., "Genotypic Analysis of Mutations in Taq I Restriction 
Recognition Sites by Restriction Fragment Length Polymorphism/Polymerase 
Chain Reaction," Proc. Natl. Acad. Sci. USA . 89:890-4 (1992); and Pourzand et 
al., "Genotypic Mutation Analysis by RFLP/PCR," Mutat. Res. . 88(1):1 13-21 
(1993)). Similar results have been obtained by digestion following PCR and 
subsequent amplification of the uncleaved DNA now enriched for mutant alleles 
(PCR-RFLP) (Kumar et al., "Oncogene Detection at the Single Cell Level," 
Oncogene , 3(6):647-51 (1988); Kumar et al., "Designed Diagnostic Restriction 
Fragment Length Polymorphisms for The Detection of Point Mutations in Ras 
Oncogenes," Oncogene Res. , 4(3):235-41 (1989); and Jacobson et al., "A Highly 
Sensitive Assay for Mutant Ras Genes and its Application to the Study of 
Presentation and Relapse Genotypes in Acute Leukemia," Oncogene , 9(2):553-63 
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(1994)). Although sensitive and rapid, RFLP detection methods are limited by the 
requirement that the location of the mutations must coincide with restriction 
endonuclease recognition sequences. To circumvent this limitation, primers that 
introduce a new restriction site have been employed in "primer-mediated" RFLP 
(Jacobson et al., "Rapid, Nonradioactive Screening for Activating Ras Oncogene 
Mutations Using PCR-Primer Introduced Restriction Analysis (PCR-PIRA)," PCR 
Methods Appl. . 1(4):299 (1992); Chen et al., "A Method to Detect Ras Point 
Mutations in Small Subpopulations of Cells," Anal. Biochem. , 195(l):51-6 
(1991); Di Giuseppe et al., "Detection of K-Ras Mutations in Mucinous Pancreatic 
Duct Hyperplasia from a Patient with a Family History of Pancreatic Carcinoma," 
Am. J. Pathol. . 144(5):889-95 (1994); Kahn et al., "Rapid and Sensitive 
Nonradioactive Detection of Mutant K-Ras Genes Via 'Enriched 1 PCR 
Amplification," Oncogene . 6:1079-83 (1991); Levi et al., "Multiple K-Ras Codon 
12 Mutations in Cholangiocarcinomas Demonstrated with a Sensitive Polymerase 
Chain Reaction Technique," Cancer Research . 51(July):3497-502 (1991); and 
Mitsudomi et al. ? "Mutations of Ras Genes Distinguish a Subset of Non-Small- 
Cell Lung Cancer Cell Lines from Small-Cell Lung Cancer Cell Lines," 
Oncogene . 6(8): 1353-62 (1991)). However, subsequent investigators have 
demonstrated that errors are produced at the very next base by polymerase 
extension from primers having 3' natural base mismatches (Hattori et al., 
"Mismatch PCR RFLP Detection of DRD2 Ser31 ICys Polymorphism and 
Schizophrenia," Biochem. Biophys. Res . Commun.. 202(2):757-63 (1994); O'Dell 
et al., "PCR Induction of a Taql Restriction Site at Any CpG Dinucleotide Using 
Two Mismatched Primers (CpG-PCR)," Genome Res. . 6(6):558-68 (1996); and 
Hodanova et al., "Incorrect Assignment of N370S Mutation Status by Mismatched 
PCR/RFLP Method in Two Gaucher Patients," J. Inherit. Metab. Pis. . 20(4):61 1-2 
(1997)). Such templates fail to cleave during restriction digestion and amplify as 
false-positives that are indistinguishable from true positive products extended 
from mutant templates. 

Use of nucleotide analogs may reduce errors resulting from 
polymerase extension and improve base conversion fidelity. Nucleotide analogs 
that are designed to base-pair with more than one of the four natural bases are 
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termed "convertides." Base incorporation opposite different convertides has been 
tested (Hoops et al., "Template Directed Incorporation of Nucleotide Mixtures 
Using Azole-Nucleobase Analogs," Nucleic Acids Res. . 25(24):4866-71 (1997)). 
For each analog, PCR products were generated using Taq polymerase and primers 
containing an internal nucleotide analog. The products generated showed a 
characteristic distribution of the four bases incorporated opposite the analogs. Of 
significance, these products retained the original sequence at all natural base 
positions. Convertides readily form degenerate amplification products by virtue of 
their ability to assume different hydrogen bonding patterns through tautomeric 
shift (Brown et al., "Synthesis and Duplex Stability of Oligonucleotides 
Containing Adenine-Guanine Analogues," Carbohydrate Research , 216:129-39 
(1991)), bond rotation (Bergstrom et al., Nucleosides and Nucleotides . 
15(l-3):59-68 (1996)), or base stacking (Bergstrom et al., Journal of the American 
Chemical Society . 1 17:1201-9 (1995); and Zhang et al., "Exploratory Studies on 
Azole Carboxamides as Nucleobase Analogs: Thermal Denaturation Studies on 
Oligodeoxyribonucleotide Duplexes Containing Pyrrol e-3-Carboxamide," Nucleic 
Acids Res. . 26:2208-15 (1998)). Thus, PCR primers containing convertides may 
be used to facilitate base conversion. In principle, using the 6H,8H-3,4- 
dihydropyrimido[4,5-c][l,2]oxazine-7-one analog Qe), which is known to exhibit 
both the C-like and T-like tautomeric forms at the 3 ' end of the primer (Brown et 
al., "Synthesis and Duplex Stability of Oligonucleotides Containing Adenine- 
Guanine Analogues," Carbohydrate Research . 216:129-39 (1991)), a C-G base- 
pair may be converted to a T-A base pair (Figure 1). Due to the better geometry, 
DNA polymerases may "read," or extend better, from a Q6-G pair than a T-G 
mismatch (wobble base pair). Similarly, DNA polymerases may "write," or 
incorporate both G and A bases opposite Qe (Hill et al., "Polymerase Recognition 
of Synthetic Oligodeoxyribonucleotides Incorporating Degenerate Pyrimidine and 
Purine Bases," Proc. Natl. Acad. Sci. USA . 95(8):4258-63 (1998)), whereas A is 
always inserted opposite a T base. Thus, the Qe analog primer serves as an 
intermediary, providing a "preconversion" step before a natural base primer is 
added to selectively amplify the desired product from the degenerate pool. While 
nucleotide analogs have great potential, they have not been tested in high 
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sensitivity assays. There is a need for a method that optimizes the fidelity of the 
analog conversion in the PCR process. 

Optimization of PCR/RE/LDR 

As discussed above, PCR used with a high fidelity conversion 
process would provide a valuable method for the amplification of mutant gene 
sequences. By designing primers with one or more mismatches, mutant DNA 
template can be efficiently extended, while poor extension is achieved on wild- 
type DNA template. However, once these primers extend with or without a 
mismatch, the products thereafter are perfect matches for the primer in subsequent 
PCR cycles. Thus, false positive signals are amplified in subsequent cycles. 
Moreover, PCR error can generate a base change in the template, which perfectly 
matches the primer. AS-PCR can detect pyrimidine<-»purine transversions at 
sensitivities of 1 in 10 5 (Newton et al., "Analysis of Any Point Mutation in DNA. 
The Amplification Refractory Mutation System (ARMS)," Nucleic Acids Res. . 
17(7):2503-16 (1989); and Tada et al., "Detection of Ras Gene Mutations in 
Pancreatic Juice and Peripheral Blood of Patients with Pancreatic 
Adenocarcinoma," Cancer Res. , 53(1 1):2472-4 (1993)). Nevertheless, the 
majority of cancer-associated mutations are C<-VT and A«-»G transitions, as, for 
example, are over 80% of p53 point mutations (de Fromentel et al., Genes 
Chromosomes Cancer , 4(1): 1-15 (1992)). A DNA diagnostic method is needed to 
accurately quantify this type of low abundance mutation. 

The ligation detection reaction (LDR) in conjunction with PCR has 
been used to quantify small amounts of PCR extension product. LDR uses two 
adjacent primers and a thermostable ligase to distinguish all four bases potentially 
found at any position in a DNA sequence (Barany, F., "Genetic Disease Detection 
and DNA Amplification Using Cloned Thermostable Ligase," Proc. Natl. Acad. 
Sci. USA , 88:189-93 (1991); Barany, F., "The Ligase Chain Reaction in a PCR 
World," PCR Methods App l.. 1:5-16 (1991); Day et al., "Detection of Steroid 21- 
Hydroxylase Alleles Using Gene-Specific PCR and a Multiplexed Ligation 
Detection Reaction," Genomics , 29:152-62 (1995); and Khanna et al., Oncogene , 
18:27-38 (1999)). Thermostable ligase demonstrates the highest fidelity when the 
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discriminating base is located at the 3 5 end of the upstream primer (Luo et al., 
"Improving the Fidelity of Thermus Thermophilus DNA Ligase," Nucleic Acids 
Res. , 24(15):3071-8 (1996)). PCR/LDR (PCR of a sequence from genomic DNA 
followed by LDR) can detect mutations with a sensitivity of approximately 1 
mutant allele in 4,000 normal alleles (Khanna et al., Oncogene . 18:27-38 (1999)). 
Sensitivity of approximately 1 in 1 0 6 has been achieved by combining PCR with 
restriction endonuclease digestion of wild-type DNA (Sandy et al., Proc. Natl. 
Acad. Sci. USA . 89:890-4 (1992); and Pourzand et al., "Genotypic Mutation 
Analysis by RFLP/PCR." Mutat. Res. . 288(1^:1 13-21 (1993)). Mutations 
occurring within the restriction site prevent cleavage of the mutant allele, while 
wild-type alleles bearing canonical restriction site sequence are depleted. As a 
result, subsequent PCR cycles preferentially amplify mutant DNA. If a mutation 
site is not within an endonuclease recognition site present in wild-type DNA, a 
restriction site must be introduced. This is typically done by PCR using a primer 
or primers with mismatched bases. Mutations cannot be detected in any portion of 
the restriction site spanned by the primers, since those bases are introduced 
directly through the primers. In a random DNA sequence, over 20% of bases are 
contained within a preexisting four-base restriction site and 60% of bases are 
within a four-base subsequence that can be converted into a restriction site by a 
single base change. In these small sites, 3 ' terminal base mismatch primers must 
frequently be used. While conceptually straightforward, 3' mismatch extension 
has proven to be difficult (Newton et al., "Analysis of Any Point Mutation in 
DNA. The Amplification Refractory Mutation System (ARMS)," Nucleic Acids 
Res. . 17(7):2503-16 (1989); Kwok et al., "Effects of Primer-Template Mismatches 
on the Polymerase Chain Reaction: Human Immunodeficiency Virus Type 1 
Model Studies." Nucleic Acids Res . 18(4):999-1005 (1990); O'Dell et al., 
Genome Res. . 6(6):558-68 (1996); and Day et al.. Nucleic Acids Res. . (1999)). 
The introduction of interrupted palindromic restriction sites has been more 
successful using internal mismatch primers spanning one half-site through the 
intervening bases up to the other half-site (Kumar et al., "Oncogene Detection at 
the Single Cell Level," Oncogene , 3(6):647-51 (1988); and Anderson et al., 
"Prevalence of RAS Oncogene Mutation in Head and Neck Carcinomas," L_ 
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OtolaryngoL . 21(5):321-6 (1992)). Several perfectly matched bases stabilize the 
3 5 end of the mismatch primer. However, this approach may be used only if the 
second half-site is present naturally in wild-type DNA. Mutations in the second 
half-site prevent digestion. Only mutations occurring at bases within the 
recognition sequence are detectable by RFLP methods. Mutations occurring at 
bases outside a preexisting restriction site in wild-type DNA may be detected by 
introducing a new restriction site containing that base. 

Restriction endonucleases recognizing interrupted palindromes are 
less abundant than endonucleases recognizing contiguous four- and six-base sites. 
Multiple base changes would often be required to introduce an interrupted 
palindrome restriction site to identify mutations at any base. 

More recently, methods of identifying known target sequences by 
probe ligation methods have been reported. U.S. Patent No. 4,883,750 to N.M. 
Whiteley, et al.; D.Y. Wu, et al., Genomics 4:560 (1989), U. Landegren, et aL, 
Science 241:1077 (1988), and E. Winn-Deen, et aL. Clin. Chem. 37:1522 (1991). 
In one approach, known as oligonucleotide ligation assay ("OLA"), two probes or 
probe elements, that span a target region of interest, are hybridized to the target 
region. Where the probe elements base pair with adjacent target bases, the 
confronting ends of the probe elements can be joined by ligation, e.g., by treatment 
with ligase. The ligated probe element is then assayed, evidencing the presence of 
the target sequence. 

In a modification of this approach, the ligated probe elements act as 
a template for a pair of complementary probe elements. With continued cycles of 
denaturation, hybridization, and ligation in the presence of pairs of probe 
elements, the target sequence is amplified linearly, allowing very small amounts of 
target sequence to be detected and/or amplified. This approach is referred to as 
ligase detection reaction. When two complementary pairs of probe elements are 
utilized, the process is referred to as the ligase chain reaction, which achieves 
exponential amplification of target sequences. F. Barany, "Genetic Disease 
Detection and DNA Amplification Using Cloned Thermostable Ligase," Proc. 
Nat'l Acad. Sci. USA . 88:189-93 (1991) and F. Barany, "The Ligase Chain 
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Reaction (LCR) in a PCR World," PCR Methods and Applications , 1 :5-16 (1991); 
Barany, U.S. Patent Application filed March 19, 1999, S/N 60/125,251. 
Techniques, such as PCR/LDR, that rely on mutant enrichment require 
optimization of reaction conditions in order to minimize random PCR errors. 
These errors would be indistinguishable from mutations originally present in 
clinical samples. One source of error-minimization may be found in optimization 
of the buffer conditions for PCR. Standard PCR buffers contain Tris, however the 
pKa of Tris is strongly dependent on temperature. A PCR reaction containing Tris 
pH 8.3 (measured at 23°C) is approximately pH 7 near 65°C (the extension 
temperature), and drops to approximately pH 6 near 95°C (the template melting 
temperature). PCR error can result from template degradation and polymerase 
misincorporation. Template degradation occurs during periods of high 
temperature and low pH in each PCR cycle and limits product size in "long" PCR 
(Barnes, "PCR Amplification of up to 35-Kb DNA with High Fidelity and High 
Yield from Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. USA , 
91(6):2216-20 (1994); Cheng et al., "Effective Amplification of Long Targets 
from Cloned Inserts and Human Genomic DNA," Proc. Natl. Acad. Sci. USA . 
91(12):5695-9 (1994); and Sang et al., "Generation of Site-Directed Mutagenesis 
by Extralong, High-Fidelity Polymerase Chain Reaction," Anal. Biochem. . 
233(1): 142-4 (1996)). Raising the buffer pH in long PCR (using Tris 9.1) reduces 
the amount of template cleavage and increases PCR efficiency (Barnes, "PCR 
Amplification of up to 35-Kb DNA with High Fidelity and High Yield from 
Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. USA , 91(6):2216-20 
(1994)). Although the efficiency of long PCR increases with higher pH, the level 
of mutations within these PCR products may also increase since high pH 
decreases the fidelity of Taq and Pfu polymerases (Eckert et al., "High Fidelity 
DNA Synthesis by the Thermus Aquaticus DNA Polymerase," Nucleic Acids 
Res. , 18(13):3739-44 (1990); Eckert et al., "DNA Polymerase Fidelity and the 
Polymerase Chain Reaction," PCR Methods Appl. . l(l):17-24 (1991); and Cline 
et al., "PCR Fidelity of Pfu DNA Polymerase and Other Thermostable DNA 
Polymerases." Nucleic Acids Res. . 24(18^:3546-51 (1996)). Use of alternative 
PCR buffers with lower | ApKa | can improve polymerase fidelity and still reduce 
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template damage by maintaining more neutral pH over a wider temperature range 
(Eckert et al., "DNA Polymerase Fidelity and the Polymerase Chain Reaction," 
PCR Methods Appl. . l(l):17-24 (1991); and Brail et al., "Improved Polymerase 
Fidelity in PCR-SSCPA," Mutat. Res. . 303(4):171-5 (1993)). The addition of 
glycerol or formamide may reduce mutations arising from template damage during 
PCR cycling and may help avoid misextension from mispaired primers (Bottema 
et al., "PCR Amplification of Specific Alleles: Rapid Detection Of Known 
Mutations and Polymorphisms," Mutat. Res. . 288(1):93-102 (1993); and Cha et 
al., "Mismatch Amplification Mutation Assay (MAMA): Application to the C-H- 
Ras Gene," PCR Methods Appl.. 2(l):14-20 (1992)). 

Thus, there is a need to improve buffer reaction conditions 
currently used in PCR, in order to minimize the opportunity for mismatches 
caused by PCR error. Increased analog conversion fidelity, alone, will not solve 
the need for an improved method of mutant DNA detection. In addition, there is a 
need to optimize PCR reaction conditions to decrease random PCR error, and 
finally, a method is needed that provides sensitive detection for the PCR extension 
products. 

SUMMARY OF THE INVENTION 

The present invention relates to a method for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence, in a plurality of target 
nucleotide sequences. This method involves a first polymerase chain reaction 
phase, a second polymerase chain reaction phase, and a restriction endonuclease 
digest reaction phase, followed by a third polymerase chain reaction phase and a 
ligase detection reaction phase. 

The starting sample of the present invention is a sample potentially 
containing one or more low abundance target nucleotide sequences with at least 
one sequence difference each from the high abundant target sequences present in 
the sample. 
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In the first polymerase chain reaction phase, a primary 
oligonucleotide primer set is provided. The primary oligonucleotide primer set 
has a first oligonucleotide primer containing a target-specific portion, and a 
second oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a polymerase chain reaction product. However, the primers each 
have a mismatch which interferes with formation of such a polymerase chain 
reaction product when hybridized to any other nucleotide sequence present in the 
sample. The primary oligonucleotide primers, the sample, and a polymerase are 
blended to form a primary polymerase chain reaction mixture. 

The primary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles involving a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturing treatment 
separates hybridized nucleic acid sequences. The hybridization treatment causes 
the target-specific portions of the primary oligonucleotide primers to hybridize to 
the target nucleotide sequences. The extension treatment causes the hybridized 
primary oligonucleotide primers to be extended to form primary extension 
products complementary to the target nucleotide sequence to which the primary 
oligonucleotide primer is hybridized. 

Next, there is a second polymerase chain reaction phase. This 
phase involves providing a secondary oligonucleotide primer set having a first 
oligonucleotide primer with a target- specific portion and a 5* upstream secondary 
primer-specific portion and a second oligonucleotide primer with a target-specific 
portion and a 5' upstream secondary primer- specific portion. The secondary 
oligonucleotide primers in a particular set are suitable for hybridization on 
complementary strands of the primary extension products to permit formation of a 
secondary polymerase chain reaction product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 
target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. The primary extension 
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products, the secondary oligonucleotide primers, and the polymerase are blended 
to form a secondary polymerase chain reaction mixture. 

The secondary polymerase chain reaction mixture is subjected to 
two or more polymerase chain reaction cycles comprising a denaturation 
treatment, a hybridization treatment, and an extension treatment. The denaturation 
treatment involves separation of hybridized nucleic acid sequences. In the 
hybridization treatment, the secondary oligonucleotide primers hybridize to the 
primary extension products. The extension treatment causes the hybridized 
primary extension products to form secondary extension products complementary 
to the primary extension products. The high abundance secondary extension 
products contain a restriction site but the low abundance secondary extension 
products do not. 

The next phase involves blending a restriction endonuclease with 
the secondary extension products to form an endonuclease digestion reaction 
mixture. The restriction endonuclease is one that cleaves, with specificity, the 
restriction endonuclease recognition site within or created when amplifying the 
high abundance target, but not the low abundance target in the secondary 
extension products. The restriction endonuclease digestion selectively destroys 
the high abundance secondary extension products. 

Next, there is a third polymerase chain reaction phase. This 
involves providing a tertiary oligonucleotide primer set having a first tertiary 
primer containing the same sequence as the 5 f upstream portion of the first 
oligonucleotide primer of the secondary oligonucleotide primer set and a second 
tertiary primer containing the same sequence as the 5 1 upstream portion of a 
second oligonucleotide primer of the secondary oligonucleotide primer set. The 
set of tertiary oligonucleotide primers may be used to amplify all of the secondary 
extension products. The secondary extension products are blended with the 
tertiary oligonucleotide primer set, and a polymerase to form a tertiary polymerase 
chain reaction mixture. 

The tertiary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles comprising a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
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causes the hybridized nucleic acid sequences to be separated, while the 
hybridization treatment involves hybridization of the tertiary oligonucleotide 
primers to hybridize to the secondary extension products. During the extension 
treatment, the hybridized tertiary oligonucleotide primers are extended to form 
tertiary extension products complementary to the uncleaved secondary extension 
products. 

Next, the tertiary extension products are subjected to a ligase 
detection reaction. This involves providing a plurality of oligonucleotide probe 
sets, each set having a first oligonucleotide probe, having a tertiary extension 
product-specific portion and a detectable reporter label, and a second 
oligonucleotide probe, having a tertiary extension product-specific portion. The 
oligonucleotide probes in a particular LDR probe set are suitable for ligation when 
hybridized adjacent to one another on a complementary tertiary extension product- 
specific portion. However, there is a mismatch which interferes with such ligation 
when hybridized to any other nucleotide sequence present in the sample. 

The tertiary extension products, the plurality of oligonucleotide 
probe sets, and a ligase are blended to form a ligase detection reaction mixture. 

The ligase detection reaction mixture is subjected to one or more 
ligase detection reaction cycles having a denaturation treatment, and a 
hybridization treatment. The denaturation treatment involves separation of 
hybridized oligonucleotides from the tertiary extension products. In the 
hybridization treatment, the oligonucleotide probe sets hybridize at adjacent 
positions in a base-specific manner to their respective tertiary extension products, 
if present. As a result, adjacent probes ligate to one another to form a ligation 
product sequence containing the detectable reporter label and the tertiary extension 
product-specific portions connected together. The oligonucleotide probe sets may 
hybridize to nucleotide sequences other than their respective complementary 
tertiary extension products but do not ligate together due to a presence of one or 
more mismatches and individually separate during the denaturation treatment. 
Following the ligase detection reaction cycles, the reporter labels of the ligation 
product sequences are detected which indicates the presence of one or more low 
abundance target nucleotide sequences in the sample. 
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The present invention also relates to a kit for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence in a plurality of target 
nucleotide sequences. This kit provides a primary oligonucleotide primer set, a 
secondary oligonucleotide primer set, a tertiary oligonucleotide primer set, and a 
plurality of oligonucleotide probe sets. 

The primary oligonucleotide primer set provided in the kit has (a) a 
first oligonucleotide primer containing a target-specific portion, and (b) a second 
oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a primary extension product. However, the primers have a mismatch 
which interferes with formation of such a polymerase chain reaction product when 
hybridized to any other nucleotide sequence present in the sample. 

The secondary oligonucleotide primer set provided in the kit has (a) 
a first oligonucleotide primer, containing a target-specific portion and a 5 1 
upstream secondary primer-specific portion, and (b) a second oligonucleotide 
primer, containing a target- specific portion and a 5 f upstream secondary primer- 
specific portion. The secondary oligonucleotide primers are suitable for 
hybridization on complementary strands of the primary extension products to 
permit formation of a secondary extension product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 
target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. 

The tertiary oligonucleotide primer set provided in the kit have (a) 
a first tertiary primer containing the same sequence as the 5 1 upstream portion of 
the first oligonucleotide primer of the secondary oligonucleotide primer set, and 
(b) a second tertiary primer containing the same sequence as the 5' upstream 
portion of a second oligonucleotide primer of the secondary oligonucleotide 
primer set. The set of tertiary oligonucleotide primers may be used to amplify all 
of the secondary extension products. 
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The kit also provides a plurality of oligonucleotide probe sets. 
Each set has (a) a first oligonucleotide probe, containing a tertiary extension 
product-specific portion and a detectable reporter label, and (b) a second 
oligonucleotide probe, containing a tertiary extension product-specific portion. 
The oligonucleotide probes in a particular set are suitable for ligation together 
when hybridized adjacent to one another on a complementary tertiary extension 
product-specific portion. However, the probes have a mismatch which interferes 
with such ligation when hybridized to any other nucleotide sequence present in the 
sample. 

There is not currently available a biological technique that is 
sensitive enough to detect and identify one or more low abundance sequences 
differing by one or more single-base changes, insertions, or deletions, from a high 
abundance sequence, in a plurality of target nucleotide sequences, without the 
potential for many errors during the amplification and detection stages. The 
present invention is directed to overcoming these and other deficiencies in the art, 
by providing a method which optimizes the conditions for selective amplification 
of low abundance DNA targets through polymerase chain reaction and restriction 
endonuclease digestion, and a method of nucleotide detection, that combined, 
creates a highly sensitive method for the amplification and detection of low 
abundance nucleotides sequences such as mutant genes. This procedure is useful 
for clinical and research applications. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figur e F igures 14s A-C are a schematic drawing of the coupled 
polymerase chain reaction-restriction endonuclease digestion-ligase detection 
reaction process of the present invention. 

Figure 2 shows the detection of ligation products resulting from the 
process of Figur e 1 . F igures 1A-C. using gel electrophoresis. 

Figure 3 shows the detection of ligation products resulting from the 
process of Figur e F igures 1 A-C using an addressable array. 
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Figure 4 shows a conversion facilitated by nucleotide analog 
preconversion. A C:G base pair in a sequence is targeted for conversion to a T:A 
basepair. Rather than using a 3' natural base mismatch primer to attempt direct 
conversion, a nucleotide analog (Qe) primer is used for preconversion. The Qe 
analog reads the G base well and allows polymerase to efficiently extend from the 
3 ' primer. During PCR, the reverse primer anneals to the Q6 PCR product and 
is extended by polymerase to synthesize the opposite strand. When polymerase 
reaches the Q6 analog in the template, polymerase writes A (or G, not shown) 
opposite the analog and continues synthesis of the strand. After a few cycles, a 
pool of products is made with degenerate sequence opposite the analogs. A 
natural base primer is then added to selectively amplify the products having the 
desired base change. 

Figure 5A-I shows the nucleotide analogs used in PCR primers. In 
the final deprotected oligonucleotide, the name of the nucleoside containing the 
base analog shown are as follows: Qi, l-(2'-deoxy-p-D-ribofiiranosyl)imidazole- 
4-carboxamide (Figure 5A); Q2, l-(2 , -deoxy-P-D-ribofuranosyl)-3-nitropyrrole 

(Figure 5B); Qs, 2'-deoxyinosine (Figure 5C); Qe, 6-(2'-deoxy-P-D- 
ribofuranosyl)-6H,8H-3 ? 4-dihydropyrimido[4,5-c][l,2]oxazine-7-one (Figure 5D); 
Q7, 2-amino-7-(2'-deoxy-P-D-ribofuranosyl)-6-methoxyaminopurine (Figure 5E); 
Q16, 1 -(2 l-deoxy-p-D-ribofuranosyl)-4-iodopyrazole (Figure 5F); Q18, l-(2'- 
deoxy-P-D-ribofuranosyl)pyrrole-3-carboxamide (Figure 5G); Qi9, l-(2'-deoxy- 
P-D-ribofuranosyl)-4-nitropyrazole (Figure 5H). Base analogs (Q) are attached to 
the 1 ' position of deoxyribofuranose. The nucleoside analogs are attached to the 
controlled pore glass (CPG) column via a succinyl linker (R = linker to CPG). 
The oligonucleotide is synthesized from the 5' hydroxyl after removal of the 
dimethoxytrityl (DMT) protecting group, placing the analog at the 3 ' end. After 
cleavage from the CPG column and deprotection, the oligonucleotide is extended 
by polymerase from the 3' base analog hydroxyl group (R = H) (Figure 51). 

Figures 6A-C show the primers used in mismatch extension and 3' 
nucleotide analog conversion. Complementary (- strand) sequences are shown in 
reverse orientation (3-5'), e.g., reverse strand primers (names ending in "R"). In 
Figure 6A, one of nine different synthetic 50 bp duplex templates (- strand. SEP. 



-21 - 



ID. No. 1 : + strand. SEP. TP. No. 2^ is shown melted with primers aligned to 
complementary sequence. Primer extension was performed using 3 ' natural base 
and nucleotide analog primers (p53-248X (SEP. TP. No. 3^ and p53-248X R (SEP. 
IP. No. 4 ). Some extension products were reamplified using truncated zipcode 
primers p53zip248 (SEP. TP. No. 5) and P 53zip248R (SEP. IP. No. 6^ and 
sequenced using one of the zipcode primers (Ztop (SEP. IP. No. 7} or Zbot ( SEP. 
IP. No. 8 V In Figure 6B, preconversion was performed on nine different 50 bp 
synthetic duplex templates (only one tem plate. SEP. IP. Nos. 1 and 2 is shown ^ 
using 3' nucleotide analog primers, e.g., p53-248Q6 (SEP. IP. No. 3^ and p53- 
248Q6R (SEP. IP. No. 4V Conversion, with or without preconversion, was 
performed using primers containing 3' natural base, e.g. primers p53zip248T 
(SEP. IP. No. y> and P 53zip248T R (SEP. IP. No. 1 OV These conversion 
products were reamplified using zipcode primers and identified by LPR. In 
Figure 6C 5 LPR probe sets were designed to identify specific base changes in 
conversion products. LPR probes anneal in competition with each other to 
conversion products. Perfectly complementary upstream and downstream LPR 
probes with no overlap or gap ligate with high specificity. Piscriminating probes 
had different length 5' tails to allow specific product separation on an acrylamide 
gel. A set of probes used to identify PCR error products in nonconversion of wild- 
type template is shown in Figure 6C: P 53LPR248FCA (SEP. TP. No. 1 1). 
p53LPR248FCG (SEP. IP. No. 12V p53LDR248FCT (SEP. IP. No. 13V 
p53LPR248FCC (SEP. IP. No. 14V and p 53LPR24 8PGG (SEP. IP. No. 15V and 
potential conversion product (SEP. IP. No. I6V 

Figures 7A-B show results of a conversion by natural base and Qe 
convertide. Conversion products from nine templates were detected by 
PCR/LPR. Each template was a 50-base pair synthetic duplex PNA of identical 
sequence except for the central four bases which have the sequence indicated. As 
shown in Figures 7A-B. temp late CCGG refers to SEP. IP. No. 2 fas shown in 
Figure 6A-BV template CTGG refers t o SEP. TP. No. 1 7. temp late CGGG refers 
to SEP. IP. N o. 1 8. tem plate CAGG refers to SEP. IP. No. 1 9_ template TCGA 
refers to SEP. IP. No. 20. template GCGC refers to SEP. IP. No. 21. temp late 
ACGT refers to SEP. IP. No. 22. temp late CATG refers to SEP. TP. No. 23. and 
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template CGCG refers to SEP. TP. No. 24. Conversion occurred within these four 
bases. The expected conversion products produced by starting with the 
conversion primers having the indicated 3 ' natural base or convertide are shown. 
Figure 7 A shows the conversion of the first base to C using primer p53zip248C 
fSEP. TP. No. 25> with and without preconversion. Figure 7B shows the 
conversion of the first base to T using primer p53zip248T (SEP. IP. No. 9^ with 
and without Q6 preconversion. 

Figures 8A-B shows conversion by natural base, Qs, and Q7 
convertides. Conversion products from nine templates , corresponding to SEP. TP. 
Nos. 2 and 1 7-24. were detected by PCR/LPR. Each template was a 50-base pair 
synthetic duplex PNA of identical sequence except for the central four bases 
which have the sequence indicated. Conversion occurred within these four bases. 
The expected conversion products produced by starting with the conversion 
primers having the indicated 3 ' natural base or convertide are shown. Figure 8 A 
shows conversion of the first base to G using primer p53zip248G f SEP. IP. 
No. 26^ w ith and without Qs or Q7 preconversion. Figure 8B shows conversion of 
the first base to A using primer p53zip248 A fSEP. IP. No. 27^ w ith and without 
Q5 or Q7 preconversion. 

Figures 9A-H show the fidelity of polymerase extension. Primer 
slippage accounts for many of the observed products of extension (Figure 7 and 
Figure 8). In Figure 9 A, perfectly complementary primer gives correct product. 
For Figure 9B, a T:G mismatch at the second base explains TGGA (or TGCA) 
product. In Figure 9C, extension from a Qe:G pairing with no slippage on the 
minus strand of the CCGG template (followed by 3 ' T conversion primers) 
resulted in the expected TCGA product. Figure 9P shows extension from a Qe:G 
pairing with no slippage on the minus strand of the CTGG template and several 
other templates (followed by 3' T conversion primers) resulted in the expected 
products. In Figure 9E, GG mismatch extension apparently gave the expected GC 
product on one template, but perhaps only fortuitously (see Figure 9F). For Figure 
9F, all extensions from GG mismatches gave GC extension products, consistent 
with a GT mismatch formed by slippage at the preceding base. In Figure 9G, 
Qs:G and Q?:G extension products apparently gave the expected GC product on 
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one template, but perhaps only fortuitously (see Figure 9H). In Figure 9H 5 all 
extensions from Qs:G and Q?:G mismatches (followed by 3' G conversion 
primers) gave GC extension products consistent with a Qs'.T or Q?:T mismatch at 
the preceding base. 

Figure 10A-C is a schematic of the amplification of p53 exon 7 
from genomic DNA (Figure 1 OA), and conversion of the normal Mspl site to a 
Taql site. First, the template is subjected to PCR amplification of the p53 Exon 7 
(Figure 10B). Next, a PCR conversion step is carried out, resulting in the 
conversion of codon 248 to a the to create a (TCGA) Taql restriction site (Figure 
10B). 

Figures 1 1 A-C show the PCR primers used to determine 
polymerase fidelity by PCR/RE/LDR at a preexisting restriction site to avoid 
conversion error. The method used is as described in Figure 6 for conversion 
steps. However, in the fidelity experiments shown in Figure 1 1, the buffer 
components were modified, to test for optimal buffer conditions within the 
conversion process. Figure 1 1 A shows a PCR fidelity assay. A synthetic 50-base 

pair duplex marker template (MK) and wild-type p53 exon 7 PCR product = (- 

strand- SEP. ID. No. 1: + strand. SEP. ID. No. 21 are mixed at known ratios in 
parallel reactions. Perfect match primers p53-248short (SEQ. ID. No. 281 and 
p53-248shortR (SEP. ID. No. 291 amplify the wild-type CCGG and marker 
CGGG. Then, longer zipcode containing primers p53zip248short ( SEP. ID. No. 
51 and p53zip248short R (SEP. ID. No. 61 were added. Finally, wild-type was 
repeatedly digested and reamplified with zipcode primers (Ztop (SEP. ID. No. 71 
and Zbot (SEP. ID. No. 81 . In Figure 1 IB, preconversion was performed using 
primers containing 3' convertide; e.g., p53-248P ^ (SEP. ID. No. 31. Conversion 
of the Mspl site to a Taql site with or without preconversion was performed using 
3' natural base primers p53zip248T (SEP. ID. No. 91 a nd p53zip248T R (SEP. ID. 
No. 101 . Long primers were added as above and conversion products further 
amplified. Wild-type products were digested with the restriction endonuclease 
appropriate for the new site. Mutant products were preferentially amplified with 
zipcode primers. In Figur e 1 1 C: Pther primers in Figure 1 1 B include p53Taq248T 
(SEP. ID. No. 301. p53Taq248TR (SE P. ID. No. 311. p53Taq248P* (SEP. ID. 
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No. 32V and p53Taq248Q«R fSRO. TP. No. 4V Tn Figure 1 1C. LDR probe sets 
were designed to query the template sequence around the point of ligation. 
Perfectly hybridized upstream and downstream LDR probes with no overlap or 
gap are preferentially ligated with high specificity. Discriminating probes have 
different length 5' tails to allow specific product separation on an acrylamide gel. 
Probes shown (SEP. ID. Nos. 11-14. as described in Figure 6Q w ere used for 
identification of mutations occurring in the second base of the Mspl site (no 
conversion). An extra probe fo53T,DR248FTCT AfSKO. TP. No. 33V was used to 

compare C— >T transitions at the first base and second base of the Mspl site. A 
comparable set of discriminating and common probes were used to identify 
mutations at the second base of the Taql site in conversion products fSHO. TP. No. 
34^ had a T at the 3' penultimate base in the discrimination primers and A at the 5' 
penultimate base in the common probe. 

Figures 12A-B show the buffer and enzyme dependent PCR errors 
detected by the PCR/RE/LDR experiments shown in Figure 1 1 . The indicated 
polymerase/buffer combinations were used to amplify p53 exon 7 from genomic 
DNA. The same buffers were used in reactions with perfect match primers to 
reamplify the Mspl site. In Figure 12 A, Tag polymerase (T) was used in various 
test PCR buffers, while in Figure 12B, Vent polymerase (V) was used in various 
test buffers. Vent polymerase did not amplify p53 exon 7 from genomic DNA in 
TsK, buffer A, buffer. In this case only, two different enzyme/buffer sets were 
used for preamplification and "conversion" (not actual conversion, since perfect 
match primers were used). The Ampli7ag TYTsK exon 7 genomic DNA PCR 
product was substituted in the Vent V/TsK, buffer A, reamplification. "C" 
indicates no MK was added (control reaction). 

Figure 13 shows the results of a comparison of conversion fidelity. 
The relative intensities of conversion reaction products is indicated by color and 
the height of each peak in a 3-D plot. Marker (MK) DNA (with CGGG replacing 
the Mspl site) was added at known ratios to wild-type (WT) in parallel reactions. 
The -log(MK:WT) indicates relative fraction of MK present, e.g., 
-log(MK:WT)-3 means the ratio of MK to WT was 1 : 1000. "C" indicates no MK 
was added (control reaction). Nonconversion control reactions (C:G) were 
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performed using perfect match 3' C primers. Conversion of the Mspl site (CCGG) 
to a Taql site (TCG A) was performed using natural base 3 ' T primers with and 
without preconversion using 3' Qe nucleotide analog primers (Q6:G and T:G 
reactions, respectively). LDR products from Mspl nonconversion contain CNGG, 
and products from Taql conversion contain TNGA, but only the central bases 
(second and third bases) are indicated as _NG_. The LDR products were designed 
to separate on acrylamide gels by two base differences in size. Some 
undetermined bands of intermediate size were also observed. Lanes 1-4 were 
digested with Mspl, while lanes 5-12 were digested with Taql during 
PCR/RE/LDR. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a method for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence, in a plurality of target 
nucleotide sequences. This method involves a first polymerase chain reaction 
phase, a second polymerase chain reaction phase, and a restriction endonuclease 
digestion reaction phase, followed by a third polymerase chain reaction phase and 
a ligase detection reaction phase. 

The starting sample of the present invention is a sample potentially 
containing one or more low abundance target nucleotide sequences with at least 
one sequence difference each from the high abundance target sequences present in 
the sample. 

In the first polymerase chain reaction phase, a primary 
oligonucleotide primer set is provided. The primary oligonucleotide primer set 
has a first oligonucleotide primer containing a target-specific portion, and a 
second oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a polymerase chain reaction product. However, the primers each 
have a mismatch which interferes with formation of such a polymerase chain 
reaction product when hybridized to any other nucleotide sequence present in the 
sample. The primary oligonucleotide primers, the sample, and a polymerase are 
blended to form a primary polymerase chain reaction mixture. 

The primary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles involving a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
separates hybridized nucleic acid sequences. The hybridization treatment causes 
the target-specific portions of the primary oligonucleotide primers to hybridize to 
the target nucleotide sequences. The extension treatment causes the hybridized 
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primary oligonucleotide primers to be extended to form primary extension 
products complementary to the target nucleotide sequence to which the primary 
oligonucleotide primer is hybridized. 

The efficiency and accuracy of converting the high abundance 
primary polymerase chain reaction product into a secondary polymerase chain 
reaction product containing a restriction endonuclease site may be improved by 
performing a step (referred to a "preconversion") prior to providing the secondary 
oligonucleotide primer set. This step consists of providing a pre-secondary 
oligonucleotide primer set having (a) a first oligonucleotide primer, with a target- 
specific portion, and (b) a second oligonucleotide primer, with a target-specific 
portion. The target-specific portions are identical or substantially identical to the 
secondary oligonucleotide primer set but at least one primer contains one or more 
nucleotide analogs. The oligonucleotides in a particular pre-secondary primer set 
are suitable for hybridization on complementary strands of the primary extension 
products. The primary extension product is denatured, and the primary extension 
products are blended with a polymerase and the pre-secondary oligonucleotide 
primers to form a pre-secondary polymerase chain reaction mixture. The mixture 
is subjected to two or more PCR cycles to permit the formation of a pre-secondary 
polymerase chain reaction product. This product contains one or more nucleotide 
analogs and opposite strand base changes, which facilitates the conversion of the 
primary polymerase chain reaction product sequence into a restriction 
endonuclease recognition site in the subsequent secondary polymerase chain 
reaction. 

Next, there is a second polymerase chain reaction phase. This 
phase involves providing a secondary oligonucleotide primer set having a first 
oligonucleotide primer with a target-specific portion and a 5 f upstream secondary 
primer-specific portion and a second oligonucleotide primer with a target-specific 
portion and a 5* upstream secondary primer-specific portion. The secondary 
oligonucleotide primers in a particular set are suitable for hybridization on 
complementary strands of the primary extension products to permit formation of a 
secondary polymerase chain reaction product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 



-28- 



target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. The primary extension 
products, the secondary oligonucleotide primers, and the polymerase are blended 
to form a secondary polymerase chain reaction mixture. 

The secondary polymerase chain reaction mixture is subjected to 
two or more polymerase chain reaction cycles comprising a denaturation 
treatment, a hybridization treatment, and an extension treatment. The denaturation 
treatment involves separation of hybridized nucleic acid sequences. In the 
hybridization treatment, the secondary oligonucleotide primers hybridize to the 
primary extension products. The extension treatment causes the hybridized 
primary extension products to form secondary extension products complementary 
to the primary extension products. The high abundance secondary extension 
products contain a restriction site but the low abundance secondary extension 
products do not. 

The next phase involves blending a restriction endonuclease with 
the secondary extension products to form an endonuclease digestion reaction 
mixture. The restriction endonuclease is one that recognizes and cleaves the 
restriction endonuclease recognition site within or created when amplifying the 
high abundance target, but not the low abundance target in the secondary 
extension products. The restriction endonuclease digestion selectively destroys 
the high abundance secondary extension products. 

Next, there is a third polymerase chain reaction phase. This 
involves providing a tertiary oligonucleotide primer set having a first tertiary 
primer containing the same sequence as the 5 ! upstream portion of the first 
oligonucleotide primer of the secondary oligonucleotide primer set and a second 
tertiary primer containing the same sequence as the 5' upstream portion of a 
second oligonucleotide primer of the secondary oligonucleotide primer set. The 
set of tertiary oligonucleotide primers may be used to amplify all of the secondary 
extension products. The secondary extension products are blended with the 
tertiary oligonucleotide primer set, and a polymerase to form a tertiary polymerase 
chain reaction mixture. 
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The tertiary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles comprising a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
causes the hybridized nucleic acid sequences to be separated, while the 
hybridization treatment involves hybridization of the tertiary oligonucleotide 
primers to hybridize to the secondary extension products. During the extension 
treatment, the hybridized tertiary oligonucleotide primers are extended to form 
tertiary extension products complementary to the secondary extension products. 

Next, the tertiary extension products are subjected to a ligase 
detection reaction. This involves providing a plurality of oligonucleotide probe 
sets, each set having a first oligonucleotide probe, having a tertiary extension 
product-specific portion and a detectable reporter label, and a second 
oligonucleotide probe, having a tertiary extension product-specific portion. The 
oligonucleotide probes in a particular set are suitable for ligation together when 
hybridized adjacent to one another on a complementary tertiary extension product- 
specific portion. However, there is a mismatch which interferes with such ligation 
when hybridized to any other nucleotide sequence present in the sample. 

The tertiary extension products, the plurality of oligonucleotide 
probe sets, and a ligase are blended to form a ligase detection reaction mixture. 

The ligase detection reaction mixture is subjected to one or more 
ligase detection reaction cycles having a denaturation treatment, and a 
hybridization treatment. The denaturation treatment involves separation of 
hybridized oligonucleotides from the tertiary extension products. In the 
hybridization treatment, the oligonucleotide probe sets hybridize at adjacent 
positions in a base-specific manner to their respective tertiary extension products, 
if present. As a result, adjacent probes ligate to one another to form a ligation 
product sequence containing the detectable reporter label and the tertiary extension 
product-specific portions connected together. The oligonucleotide probe sets may 
hybridize to nucleotide sequences other than their respective complementary 
tertiary extension products but do not ligate together due to a presence of one or 
more mismatches and individually separate during the denaturation treatment. 
Following the ligase detection reaction cycles, the reporter labels of the ligation 
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product sequences are detected which indicates the presence of one or more low 
abundance target nucleotide sequences in the sample. 

FigureF igures 1-h *A-C are a schematic diagram of the amplification 
and detection of mutant DNA by PCR/RE/LDR according to the method of the 
present invention. The process begins with a sample containing a normal, or wild- 
type sequence in high abundance, and, possibly, a mutant of that sequence, such as 
a cancer gene, in low abundance. Comparing the mutant and wild type sequences 
in Figur e 1 . F igures 1A-C. it is apparent that they differ with respect to only one 
base. In the wild-type sequence, the distinguishing base is a "C," while the mutant 
sequence has a "T" at the corresponding location. 

First, the sample is subjected to a primary PCR step, which creates 
a modified target sequence. In the primary PCR procedure shown in Steps 1 and 2 
of Figure 1, wild-type and mutant DNA undergo denaturation at 94°C, to create 
single-stranded DNA templates. As shown in Step 1 of Figure 1, primers A and 
B, each containing 3' terminal nucleotide analogue are annealed to the single 
stranded DNA templates. Facilitated by a polymerase, the analogue primers 
undergo an extension procedure as shown in Step 2 of Figure 1 . This process of 
denaturing double stranded nucleic acids, hybridizing primers to the resulting 
single stranded nucleic acids, and extending the primers is repeated, in accordance 
with conventional PCR procedures, to produce, in quantity, extension products in 
the form of modified target sequences containing the nucleotide analogue. The 
polymerase chain reaction process is fully described in H. Erlich, et al., "Recent 
Advances in the Polymerase Chain Reaction," Science 252: 1643-50 (1991); M. 
Innis, et al., PCR Protocols: A Guide to Methods and Applications , Academic 
Press: New York (1990); and R. Saiki, et al., "Primer-directed Enzymatic 
Amplification of DNA with a Thermostable DNA Polymerase," Science 239: 487- 
91 (1988), which are hereby incorporated by reference. Further details for use of 
PCR in accordance with the present invention are provided in the Examples 
below. 

As shown in Step 1 of Figure 1, the wild-type and mutant 
sequences have complementary X: W and Y:Z base pairs proximate to the 
distinguishing bases of these sequences. When these sequences are denatured in 
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the primary PCR procedure, primers A and B are configured to hybridize to those 
sequences such that Q bonds to the Y and W bases. In subsequent cycles of the 
primary PCR step, polymerase encounters a Q analog in what is now a template 
strand. The polymerase can "read" the analog as one of several bases and it will 
"write" opposite the Q one of several different bases which in effect are 
"complementary" to the analog. Because different bases can be incorporated at 
such positions, the products are degenerate, ideally having G, A, T and C present 
in the same positions opposite Q analogs in the pool of products. These 
degenerate bases are indicated by "N." As shown in Step 2 of Figure 1, the 
mismatch, formed in both the wild-type and mutant sequence, is signified as base 
"N". 

Nucleotide analogs suitable for the present invention include, but 
are not limited to, the following: Ql 5 l-(2'-deoxy-fl-D-ribofuranosyl)imidazole-4- 
carboxamide; Q2, l-(2'-deoxy-B-D-ribofuranosyl)-3-nitropyrrole; Q5 3 T- 
deoxyinosine; Q6, 6-(2'-deoxy-B-D-ribofuranosyl)-6H,8H-3,4- 
dihydropyrimido[4,5-c] [ 1 ,2]oxazine-7-one; Q7, 2-amino-7-(2 '-deoxy-B-D- 
ribofuranosyl)-6-methoxyaminopurine; Q16, l-(2'-deoxy-B-D-ribofuranosyl)-4- 
iodopyrazole; Q18, l-(2'-deoxy-B-D-ribofuranosyl)pyrrole-3-carboxamide; Q19, 
1 -(2 '-deoxy-B-D-ribofuranosyl)-4-nitropyrazole. 

Next, as shown in Step 3 of Figure 1, a secondary PCR phase is 
carried out. The oligonucleotide primer set of the secondary PCR phase has a first 
oligonucleotide primer with a target-specific portion and a 5 f upstream secondary 
primer-specific portion, and a second oligonucleotide primer with a target-specific 
portion and a 5' upstream secondary primer-specific portion. These are 
represented by primers "C" and "D" in Step 3 of Figure 1 . During the secondary 
PCR phase, the target specific portions of the secondary PCR primers will 
hybridize to complementary strands of the primary extension products. As shown 
in Step 3 of Figure 1 , these hybridized primers are then extended using polymerase 
to form secondary PCR extension products, as shown in Step 3 of Figure 1 . The 
secondary PCR phase is repeated for 2-20 PCR cycles, involving denaturation of 
double stranded nucleic acids, primer hybridization, and primer extension, until 
the primary PCR extension product is sufficiently amplified. 
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Due to the "T" v. "C" difference in the mutant sequence versus the 
wild-type sequence, respectively, as shown in Figur e L F igures 1A-C. the 
secondary extension products can be treated with the appropriate restriction 
endonuclease (RE) that recognizes the RE site in the secondary extension products 
derived from the wild-type sequence. However, the secondary PCR extension 
product formed from the mutant nucleic acid sequence does not contain a 
restriction endonuclease site. In the embodiment of Figur e 1 . F igures 1 A-Cl the 
restriction endonuclease recognition site incorporated in the secondary extension 
products derived from the wild- type sequence is a Taql recognition site, 5' 
— TCGA — 3'. Taql cleaves specifically within this recognition site, between the T 
and the C, in each strand of the double stranded DNA secondary extension 
product. However, other restriction endonuclease sites and their corresponding 
restriction endonucleases could instead be utilized. 

Once the restriction site is incorporated into the high abundance 
target, the appropriate restriction endonuclease is added, under conditions that 
allow the digestion of the restriction site nucleotide sequence. Restriction 
endonucleases, derived from bacteria, are enzymes that cut DNA within a 
nucleotide chain. Each restriction endonuclease recognizes specific short 
oligonucleotides from four to eight residues long in a DNA sequence. Under 
appropriate conditions the RE cleaves the each strand at a phosphodiester bond 
within the recognition site. A restriction endonuclease cuts a pure DNA sample 
into consistently reproducible fragments that can easily be separated by gel 
electrophoresis. Several hundred restriction endonucleases have been identified 
and are commercially available. See Darnell et al., "Manipulating 
Macromolecules," Molecular Cell Biology . Second Edition, New York, New 
York: W. H. Freeman and Company, pp. 189-225 (1990), which is hereby 
incorporated by reference. Any RE can be used in accordance with the method of 
the present invention. The choice of RE to be used is made based upon the 
information available for the DNA in the starting sample. The RE sites that occur 
within a gene are readily available through resources such as GenBank; the 
manufacturer, if the DNA has been obtained commercially, or will have been 
identified as part of the study of the gene, prior to the use of the method of the 
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present invention. Primer design for the PCR phases of the invention is based 
upon the knowledge of the restriction sites in a gene. RE site identification can be 
readily determined by DNA mapping and DNA cloning. Watson et al., "In Vitro 
Mutagenesis/' Recombinant DNA . Second Edition, New York, New York: W. H. 
Freeman and Company, pp. 191-194 (1983), which is hereby incorporated by 
reference. Optimal conditions for complete cleavage by a restriction endonuclease 
are specific. Digestion should be carried out per manufacturer's recommendations 
for a particular RE, or as described in Sambrook et al. 5 Molecular Cloning: A 
Laboratory Manual , Second Edition, Cold Spring Harbor Press, NY (1989), which 
is hereby incorporated by reference. 

After treatment of the secondary extension product with a 
restriction endonuclease, a tertiary PCR process is carried out. As shown in Step 6 
of Figure 1, this involves providing a tertiary oligonucleotide primer set having a 
first tertiary primer that is a universal primer (Ul), containing the same sequence 
as the 5 1 upstream portion of the first oligonucleotide primer of the secondary 
oligonucleotide primer set, and a second tertiary primer containing the same 
sequence as the 5 ! upstream portion of a second oligonucleotide primer of the 
secondary oligonucleotide primer set. The set of tertiary oligonucleotide primers 
are used in amplifying the restriction endonuclease-digested secondary extension 
products. The secondary extension products are blended with the tertiary 
oligonucleotide primer set, and a polymerase to form a tertiary polymerase chain 
reaction mixture. 

The tertiary polymerase chain reaction mixture is subjected to 
polymerase chain reaction conditions involving denaturation of double stranded 
nucleic acids, hybridization of tertiary oligonucleotide primers to the resulting 
single stranded nucleic acids, and extension of the hybridized primer. This 
procedure is repeated through a sufficient number of cycles involving these steps 
in order to produce an appropriate amount of tertiary PCR extension products. 
Since the secondary PCR extension products have been treated with a restriction 
endonuclease, the tertiary PCR extension products derived from the wild-type 
nucleic acid sequence are short as a result of their being cleaved proximate to the 
nucleotide corresponding to the distinguishing base. On the other hand, the 
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tertiary PCR extension products derived from the mutant nucleic acid sequence are 
larger and not cleaved proximate to the nucleotide corresponding to the 
distinguishing base. See Figure 1, Step 6. As a result, the tertiary extension 
products derived from the mutant nucleic acid sequence are readily distinguished 
from those derived from wild-type nucleic acid sequence in the subsequent LDR 
step. 

A second restriction endonuclease digest may also be carried out 
following the tertiary PCR phase, to remove any high abundance targets that may 
have been amplified due to PCR error or incomplete digestion in the first RE step. 

Following the tertiary PCR extension procedure, the resulting 
tertiary extension products are subjected to an LDR procedure according to either 
Step 7 A or Step 7B of Figure 1 . In either of these alternatives, the LDR procedure 
begins by first denaturing the tertiary PCR extension products to produce single- 
stranded DNA (ssDNA). The discrimination probes are distinguished by having a 
different nucleotide on the 3 ' ends and a different number of adenine bases on 
their 5' ends. As a result, ligation products produced from each of the 
discrimination probes are identified by detection of the fluorescent label and 
distinguished from one another on a gel by their differing electrophoretic mobility. 

In Step 7A of Figure 1, the LDR probe set contains 4 alternative 
discrimination probes and a common probe, which can ligate to any of the four 
discrimination probes. The common probe contains a fluorescent tag on its 3 5 
end. The tertiary extension products, these oligonucleotide probes, and a 
thermostable ligase are combined to form a ligase detection reaction mixture and 
subjected to a series of ligase detection reaction cycles. 

The ligase detection reaction is described generally in WO 
90/17239 to Barany et al., "Cloning, Overexpression and Nucleotide Sequence of 
a Thermostable DNA Ligase-encoding Gene," Gene , 109:1-1 1 (1991), and F. 
Barany, "Genetic Disease Detection and DNA Amplification Using Cloned 
Thermostable Ligase," Proc. Natl. Acad. Sci. USA . 88:189-193 (1991), and U.S. 
Patent Nos. 5,494,810, 5,830,71 1 and 6,027,889 to Barany, the disclosures of 
which are hereby incorporated by reference. In accordance with the present 
invention, the ligase detection reaction can use 2 sets of complementary 
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oligonucleotides. This is known as the ligase chain reaction, which is described in 
the immediately preceding references, which are hereby incorporated by reference. 
Alternatively, the ligase detection reaction can involve a single cycle that is known 
as the oligonucleotide ligation assay. See Landegren, et aL, "A Ligase-Mediated 
Gene Detection Technique," Science 241 : 1077-80 (1 988); Landegren, et aL, 
"DNA Diagnostics -- Molecular Techniques and Automation," Science 242:229- 
37 (1988); and U.S. Patent No. 4,988,617 to Landegren, et al., which are hereby 
incorporated by reference. 

During ligase detection reaction phases, the denaturation treatment 
is carried out at a temperature of 80-105° C, while hybridization takes place at 50- 
85° C. Each cycle comprises a denaturation treatment and a thermal hybridization 
treatment which in total is from about one to five minutes long. Typically, the 
ligation detection reaction involves repeatedly denaturing and hybridizing for 2 to 
50 cycles. The total time for the ligase detection reaction phase is 1 to 250 
minutes. 

The preferred thermostable ligase is that derived from Thermus 
aquaticus. This enzyme can be isolated from that organism. M. Takahashi, et al., 
"Thermophilic DNA Ligase," J. Biol. Chem. 259:10041-47 (1984), which is 
hereby incorporated by reference. Alternatively, it can be prepared recombinantly. 
Procedures for such isolation as well as the recombinant production of Thermus 
aquaticus ligase (as well as Thermus themophilus ligase) are disclosed in WO 
90/17239 to Barany, et al., F. Barany, et al., "Cloning, Overexpression and 
Nucleotide Sequence of a Thermostable DNA-Ligase Encoding Gene," Gene 
109:1-1 1 (1991), and U.S. Patent Nos. 5,494,810 and 5,830,71 1 to Barany, which 
are hereby incorporated by reference. These references contain complete sequence 
information for this ligase as well as the encoding DNA. Other suitable ligases 
include E. coli ligase, T4 ligase, and Pyrococcus ligase. 

The ligation detection reaction mixture may include a carrier DNA, 
such as salmon sperm DNA. 

The hybridization step in the ligase detection reaction, which is 
preferably a thermal hybridization treatment, discriminates between nucleotide 
sequences based on a distinguishing nucleotide at the ligation junctions. The 
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difference between the target nucleotide sequences can be, for example, a single 
nucleic acid base difference, a nucleic acid deletion, a nucleic acid insertion, or 
rearrangement. Such sequence differences involving more than one base can also 
be detected. 

The oligonucleotide probe sets, as noted above, may have a reporter 
label suitable for detection. Useful labels include chromophores, fluorescent 
moieties, enzymes, antigens, heavy metals, magnetic probes, dyes, phosphorescent 
groups, radioactive materials, chemiluminescent moieties, and electrochemical 
detecting moieties. Also appropriate would be the use of molecular weight tags 
for discrimination by mass spectroscopy, such as a matrix assisted laser desorption 
ionization - time of flight mass spectroscopy (MALDI-TOF) array system. 

The oligonucleotide probe sets can be in the form of 
ribonucleotides, deoxynucleotides, modified ribonucleotides, modified 
deoxyribonucleotides, peptide nucleotide analogues, modified peptide nucleotide 
analogues, modified phosphate-sugar-backbone oligonucleotides, nucleotide 
analogs, and mixtures thereof. 

The oligonucleotides of the oligonucleotide probe sets each have a 
hybridization or melting temperature (i.e. T m ) of 66-70°C. These oligonucleotides 

are 20-28 nucleotides long. 

As shown in Step 7A of Figure 1, the ligation products derived 
from the mutant nucleic acid sequence are formed from the discrimination probe 
having 2 adenine bases on its 5' end. The other discrimination probes will not be 
ligated to the common probe as a result of the mutant nucleic acid sequence being 
present. 

On the other hand, for the most part, no ligation products using the 
above-described probe sets, will be formed from the wild-type sequence. This 
occurs because the endonuclease digestion causes tertiary extension products 
derived from the wild-type nucleic acid sequence to be cleaved, and, as a result, 
both the common and discrimination probes of that probe set will not hybridize to 
such tertiary extension product in a manner permitting them to be ligated together. 
However, to the extent the endonuclease digestion of the secondary extension 
products derived from the wild-type sequence is incomplete, a small residual 
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amount of uncleaved tertiary extension product derived from the wild-type 
sequence may be present in the ligase detection reaction mixture. This will cause 
ligation products to be produced from the discrimination probe having no adenines 
at its 5' end. 

Figure 2 shows the gel electrophoresis detection of products formed 
from the ligase detection reaction procedure of Figure 1, Step 7A. As shown in 
Figure 2, unligated probes have great electrophoretic mobility and form a band at 
the bottom of the gel. Ligation products derived from any small amount of tertiary 
extension products derived from the wild-type sequence have the next greatest 
electrophoretic mobility and form a band at an intermediate location on the gel. 
Ligation products derived from the mutant sequence have less electrophoretic 
mobility than the wild-type ligation products, and form a thick band slightly above 
that of the wild-type ligation products. If no mutant sequence is present in the 
sample, no such gel band will be formed. The top two bands are trace ligation 
products with low electrophoretic mobility. These high molecular weight bands 
are artifacts, presumably arising from polymerase error and template 
decomposition during PCR. The composition of the PCR buffer can strongly 
affect the fraction of these PCR error products present in the sample after 
amplification. 

The use of capillary and gel electrophoresis to detect DNA products 
based on differences in their electrophoretic mobility is well known. See e.g., 
Grossman, et-al., "High-density Multiplex Detection of Nucleic Acid Sequences: 
Oligonucleotide Ligation Assay and Sequence-coded Separation," Nucl. Acids 
Res . 22(21): 4527-34 (1994), which is hereby incorporated by reference. 

As an alternative to the ligase detection reaction procedure shown 
in Figure 1 , Step 7B, the discrimination probes can be provided with different 
address-specific portions (i.e. ZIP1, ZIP2, ZIP3, and ZIP4) at their 5' ends. This 
enables the ligation products to be detected on an addressable array having 
different capture oligonucleotide probes which are complementary to the address- 
specific portions on the discrimination probes. As a result, each ligation product 
is directed to a different address of a DNA microarray during a hybridization 
procedure subsequent to the ligase detection reaction phase. The hybridized 



-38- 



ligation products all have the same label but are distinguished from one another by 
the location on which the ligation products are immobilized. Alternatively, 
different labels can be on the discrimination base, while the address-specific 
portion is on the common probe. In this embodiment, different ligation products 
will be immobilized on the array at locations with the same capture probe; 
however, the different ligation products will be distinguished by different labels. 

Figure 3 shows the detection of ligation products resulting from the 
process of Figure 1 , Step 7B using an addressable array. In this case, as shown in 
Figure 1 , Step 7B, the common probe has a label and the discrimination probes 
have the address-specific portion. In Figure 3, the ligation products are caused to 
contact the addressable array under conditions effective to permit address- specific 
portions of the ligation products to hybridize to capture oligonucleotides on the 
array. As shown in Figure 3, Steps 2 and 3, discrimination probes (e.g., probes 
having address-specific portions ZIP4 (not shown) and ZIP3 (shown)) which do 
not form ligation products are immobilized on the array but are not detected due to 
the absence of a label. Unligated common probes do not hybridize to the array 
and are subsequently washed away (e.g., at 65°C-80°C and low salt) so they 
produce no signal. Any small amount of ligation products derived from the wild- 
type nucleic acid sequence (due to incomplete upstream endonuclease digestion) 
are immobilized (together with any of the corresponding unligated discrimination 
probe) at the capture oligonucleotide complementary to address-specific portion 
ZIP1 . Similarly, any ligation products derived from the mutant sequence are 
immobilized (together with any of the corresponding unligated probe) at the 
capture oligonucleotide complementary to address-specific portion ZEP2. The 
presence of mutant sequence and/or wild-type sequence is detected in this 
embodiment by the existence of a fluorescent signal at respective different 
locations on the array. Heterozygosity is indicated by equal signals at the capture 
oligonucleotides complementary to address-specific portions ZIP 1 andZIP2. The 
signals may be quantified using a fluorescent imager. This format uses a unique 
address for each allele and may be preferred for achieving very accurate detection 
of low levels of signal (30 to 100 attomoles of LDR product). 



-39- 

The use of a solid support with an array of capture oligonucleotides 
is fully disclosed in pending provisional U.S. Patent Application Serial No. 
60/01 1,359, which is hereby incorporated by reference. This method involves 
providing a solid support having an array of positions each suitable for attachment 
of an oligonucleotide. A linker or support (preferably non-hydrolyzable), suitable 
for coupling an oligonucleotide to the solid support at each of the array positions, 
is attached to the solid support. An array of oligonucleotides on a solid support is 
formed by a series of cycles of activating selected array positions for attachment of 
either single or a multiple nucleotides at the activated array positions. 

When using such arrays, the oligonucleotide probes used in the 
above-described coupled PCR and LDR phases, respectively, have an addressable 
array-specific portion. After the PCR and LDR phases are completed, the 
addressable array-specific portions for the products of such processes remain 
single stranded and are caused to hybridize to the capture oligonucleotides during 
a capture phase. C. Newton, et al., "The Production of PCR Products With 5' 
Single-Stranded Tails Using Primers That Incorporate Novel Phosphoramidite 
Intermediates," Nucl. Acids Res. 2 1 (5): 1 155-62 (1993), which is hereby 
incorporated by reference. 

During the capture phase of the process, the mixture is contacted 
with the solid support at a temperature of 45-90° C and for a time period of up to 
60 minutes. Adding cations, volume exclusion or chaotropic agents may 
accelerate hybridizations. When an array consists of dozens to hundreds of 
addresses, it is important that the correct ligation product sequences have an 
opportunity to hybridize to the appropriate address. This may be achieved by the 
thermal motion of oligonucleotides at the high temperatures used, by mechanical 
movement of the fluid in contact with the array surface, or by moving the 
oligonucleotides across the array by electric fields. After hybridization, the array 
is washed sequentially with a low stringency wash buffer and then a high 
stringency wash buffer. 

It is important to select capture oligonucleotides and addressable 
nucleotide sequences which will hybridize in a stable fashion. This requires that 
the oligonucleotide sets and the capture oligonucleotides be configured so that the 
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oligomicleotide sets hybridize to the target nucleotide sequences at a temperature 
less than that which the capture oligonucleotides hybridize to the addressable 
array-specific portions. Unless the oligonucleotides are designed in this fashion, 
false positive signals may result due to capture of adjacent unreacted 
oligonucleotides from the same oligonucleotide set which are hybridized to the 
target. 

The capture oligonucleotides can be in the form of ribonucleotides, 
deoxyribonucleotides, modified ribonucleotides, modified deoxyribonucleotides, 
peptide nucleotide analogues, modified peptide nucleotide analogues, modified 
phosphate-sugar backbone oligonucleotides, nucleotide analogues, and mixtures 
thereof. 

It may be desirable to destroy chemically or enzymatically 
unconverted all unligated LDR oligonucleotide probes that contain addressable 
nucleotide array-specific portions prior to capture of the ligation products on a 
DNA array. Such unconverted probes will otherwise compete with ligation 
products for binding at the addresses on the array of the solid support which 
contain complementary sequences. Destruction can be accomplished by utilizing 
an exonuclease, such as exonuclease III (L-H Guo and R. Wu, Methods in 
Enzymology 100:60-96 (1985), which is hereby incorporated by reference) in 
combination with LDR probes that are blocked at the ends and not involved with 
ligation of probes to one another. The blocking moiety could be a reporter group 
or a phosphorothioate group. T.T. Nikiforow, et al., "The Use of 
Phosphorothioate Primers and Exonuclease Hydrolysis for the Preparation of 
Single-stranded PCR Products and their Detection by Solid-phase Hybridization," 
PCR Methods and Applications , 3:p.285-291 (1994), which is hereby incorporated 
by reference. After the LDR process, unligated probes are selectively destroyed by 
incubation of the reaction mixture with the exonuclease. The ligated probes are 
protected due to the elimination of free 3' ends which are required for initiation of 
the exonuclease reaction. This approach results in an increase in the signal-to- 
noise ratio, especially where the LDR reaction forms only a small amount of 
product. Since unligated oligonucleotides compete for capture by the capture 
oligonucleotide, such competition with the ligated oligonucleotides lowers the 
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signal. An additional advantage of this approach is that unhybridized label- 
containing sequences are degraded and, therefore, are less able to cause a target- 
independent background signal, because they can be removed more easily from the 
DNA array by washing. 

A wide variety of infectious diseases can be detected by the method 
of the present invention. Typically, these are caused by bacterial, viral, parasite, 
and fungal infectious agents. The resistance of various infectious agents to drugs 
can also be determined using the present invention. 

Bacterial infectious agents which can be detected by the present 
invention include Escherichia coli, Salmonella, Shigella, Klebsiella, 
Pseudomonas, Listeria monocytogenes, Mycobacterium tuberculosis, 
Mycobacterium avium-intracellulare, Yersinia, Francisella, Pasteurella, Brucella, 
Clostridia, Bordetella pertussis, Bacteroides, Staphylococcus aureus, 
Streptococcus pneumonia, B-Hemolytic strep., Corynebacteria, Legionella, 
Mycoplasma, Ureaplasma, Chlamydia, Neisseria gonorrhea, Neisseria 
meningitides, Hemophilus influenza, Enterococcus faecalis, Proteus vulgaris, 
Proteus mirabilis, Helicobacter pylori, Treponema palladium, Borrelia 
burgdorferi, Borrelia recurrentis, Rickettsial pathogens, Nocardia, and 
Actinomycetes. 

Fungal infectious agents, which can be detected by the present 
invention, include Cryptococcus neoformans, Blastomyces dermatitidis, 
Histoplasma capsulatum, Coccidioides immitis, Paracoccidioides brasiliensis, 
Candida albicans, Aspergillus fumigatus, Phycomycetes (Rhizopus), Sporothrix 
schenckii, Chromomycosis, and Maduromycosis . 

Viral infectious agents, which can be detected by the present 
invention, include human immunodeficiency virus, human T-cell 
lymphocytotrophic virus, hepatitis viruses (e.g., Hepatitis B virus and Hepatitis C 
virus), Epstein-Barr virus, cytomegalovirus, human papillomaviruses, orthomyxo 
viruses, paramyxo viruses, adenoviruses, corona viruses, rhabdo viruses, 
polioviruses, toga viruses, bunya viruses, arena viruses, rubella viruses, and reo 
viruses. 
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Parasitic agents which can be detected by the present invention 
include Plasmodium falciparum, Plasmodium malaria, Plasmodium vivax, 
Plasmodium ovale, Onchoverva volvulus, Leishmania, Trypanosoma spp. f 
Schistosoma spp., Entamoeba histolytica, Cryptosporidium, Giardia spp., 
Trichomonas spp., Balantidium coli, Wuchereria bancrofti, Toxoplasma spp., 
Enterobius vermicularis, Ascaris lumbricoides, Trichuris trichiura, Dracunculus 
medinesis, trematodes, Diphyllobothrium latum, Taenia spp. t Pneumocystis 
carinii, and Necator americanus. 

The present invention is also useful for detection of drug resistance 
by infectious agents. For example, vancomycin-resistant Enterococcus faecium, 
methicillin-resistant Staphylococcus aureus, penicillin-resistant Streptococcus 
pneumoniae, multi-drug resistant Mycobacterium tuberculosis, and AZT-resistant 
human immunodeficiency virus can all be identified with the present invention. 

Genetic diseases can also be detected by the method of the present 
invention. This can be carried out by prenatal or post-natal screening for 
chromosomal and genetic aberrations or for genetic diseases. Examples of 
detectable genetic diseases include: 21 hydroxylase deficiency, cystic fibrosis, 
Fragile X Syndrome, Turner Syndrome, Duchenne Muscular Dystrophy, Down 
Syndrome or other trisomies, heart disease, single gene diseases, HLA typing, 
phenylketonuria, sickle cell anemia, Tay-Sachs Disease, thalassemia, Klinefelter 
Syndrome, Huntington Disease, autoimmune diseases, lipidosis, obesity defects, 
hemophilia, inborn errors of metabolism, and diabetes. 

Cancers that can be detected by the method of the present invention 
generally involve oncogenes, tumor suppressor genes, or genes involved in DNA 
amplification, replication, recombination, or repair. Examples of these include: 
BRCA1 gene, p53 gene, APC gene, Her2/Neu amplification, Bcr/Abl, K-ras gene, 
and human papillomavirus Types 16 and 18. Various aspects of the present 
invention can be used to identify amplifications, large deletions as well as point 
mutations and small deletions/insertions of the above genes in the following 
common human cancers: leukemia, colon cancer, breast cancer, lung cancer, 
prostate cancer, brain tumors, central nervous system tumors, bladder tumors, 
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melanomas, liver cancer, osteosarcoma and other bone cancers, testicular and 
ovarian carcinomas, head and neck tumors, and cervical neoplasms. 

In the area of environmental monitoring, the present invention can 
be used for detection, identification, and monitoring of pathogenic and indigenous 
microorganisms in natural and engineered ecosystems and microcosms such as in 
municipal waste water purification systems and water reservoirs or in polluted 
areas undergoing bioremediation. It is also possible to detect plasmids containing 
genes that can metabolize xenobiotics, to monitor specific target microorganisms 
in population dynamic studies, or either to detect, identify, or monitor genetically 
modified microorganisms in the environment and in industrial plants. 

In the food and feed industry, the present invention has a wide 
variety of applications. For example, it can be used for identification and 
characterization of production organisms such as yeast for production of beer, 
wine, cheese, yogurt, bread, etc. Another area of use is with regard to quality 
control and certification of products and processes (e.g., livestock, pasteurization, 
and meat processing) for contaminants. Other uses include the characterization of 
plants, bulbs, and seeds for breeding purposes, identification of the presence of 
plant-specific pathogens, and detection and identification of veterinary infections. 

The present invention is useful in detecting the presence of one or 
more of a low abundance sequence, that differs from a high abundance sequence 
by one or more single-base changes, insertions, deletions, or translocations when 
the low abundance sequence is present in the same sample in less than a 1 : 1,000 
molar ratio, preferably, in less than a 1 : 1 0,000 molar ratio, and, most preferably, in 
less than a 1 : 100,000 molar ratio, to the amount of the high abundance sequence. 

Another aspect of the present invention is its capability to quantify 
the amount of target nucleotide sequence in a sample. This can be achieved by 
establishing an internal standard (i.e. where the standard establishing material is 
amplified and detected with the sample). 

For quantification using an internal standard, a known amount of 
one or more marker target nucleotide sequences is added to the sample. In 
addition, a plurality of marker-specific oligonucleotide probe sets are added along 
with the ligase, the previously discussed oligonucleotide probe sets, and the 
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tertiary extension products to a mixture. The marker-specific oligonucleotide 
probe sets have (1) a first oligonucleotide probe with a target-specific portion 
complementary to the marker target nucleotide sequence, and (2) a second 
oligonucleotide probe with a target-specific portion complementary to the marker 
target nucleotide sequence and a detectable reporter label. The oligonucleotide 
probes in a particular marker-specific oligonucleotide set are suitable for ligation 
together when hybridized adjacent to one another on a corresponding marker 
target nucleotide sequence. However, there is a mismatch which interferes with 
such ligation when hybridized to any other nucleotide sequence present in the 
sample or added marker sequences. The presence of ligation product sequences is 
identified by detection of reporter labels. The amount of target nucleotide 
sequences in the sample is then determined by comparing the amount of ligation 
product sequence generated from known amounts of marker target nucleotide 
sequences with the amount of other ligation product sequences. 

The present invention also relates to a kit for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence in a plurality of target 
nucleotide sequences. This kit provides a primary oligonucleotide primer set, a 
secondary oligonucleotide primer set, a tertiary oligonucleotide primer set, and a 
plurality of oligonucleotide probe sets. 

The primary oligonucleotide primer set provided in the kit has (a) a 
first oligonucleotide primer containing a target-specific portion, and (b) a second 
oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a primary extension product. However, the primers have a mismatch 
which interferes with formation of such a polymerase chain reaction product when 
hybridized to any other nucleotide sequence present in the sample. 

The secondary oligonucleotide primer set provided in the kit has (a) 
a first oligonucleotide primer, containing a target-specific portion and a 5' 
upstream secondary primer-specific portion, and (b) a second oligonucleotide 
primer, containing a target-specific portion and a 5* upstream secondary primer- 
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specific portion. The secondary oligonucleotide primers are suitable for 
hybridization on complementary strands of the primary extension products to 
permit formation of a secondary extension product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 
target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. 

The tertiary oligonucleotide primer set provided in the kit have (a) 
a first tertiary primer containing the same sequence as the 5 ! upstream portion of 
the first oligonucleotide primer of the secondary oligonucleotide primer set, and 
(b) a second tertiary primer containing the same sequence as the 5 f upstream 
portion of a second oligonucleotide primer of the secondary oligonucleotide 
primer set. The set of tertiary oligonucleotide primers may be used to amplify all 
of the secondary extension products. 

The kit also provides a plurality of oligonucleotide probe sets. 
Each set has (a) a first oligonucleotide probe, containing a tertiary extension 
product-specific portion and a detectable reporter label, and (b) a second 
oligonucleotide probe, containing a tertiary extension product-specific portion. 
The oligonucleotide probes in a particular set are suitable for ligation together 
when hybridized adjacent to one another on a complementary tertiary extension 
product-specific portion. However, the probes have a mismatch which interferes 
with such ligation when hybridized to any other nucleotide sequence present in the 
sample. 

The kit may also contain a polymerase, an endonuclease restriction 
enzyme, and/or a ligase, as described above. 
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EXAMPLES 

Optimizing Analog Additions under PCR Conditions 

Example 1 - Oligonucleotide Synthesis for Analog Efficiency 

Oligonucleotides were synthesized at the 0.2 ^imole scale by 
cyanoethyl phosphoramidite chemistry on an Applied Biosystems 394 DNA 
synthesizer. Standard 500A CPG columns and reagents (Applied Biosystems) 
were used with the following exceptions: oligonucleotides 50 bases in length 
were synthesized using wide-pore 1000A CPG columns (Applied Biosystems); 
oligonucleotides with fluorescent dye FAM at the 5' terminus were synthesized 
using FAM phosphoramidite (Applied Biosystems) with a 15 minute coupling 
step; oligonucleotides with 5' phosphate were synthesized using phosphorylation 
reagent (Glen Research), with a 15 minute coupling step; oligonucleotides with 3' 
blocking group were synthesized using Y-Spacer CPG columns (Glen Research). 
Oligonucleotides with the 3' nucleotide analogs 2'-deoxyinosine Qs), 6-(2'-deoxy- 
p-D-ribofuranosyl)-6H, 8H-3,4-dihydropyrimido[4 ? 5-cl[l,2]oxazine-7-one (Q&), 
and 2-amino-7-(2'-deoxy-P-D-ribofuranosyl)-6-methoxyaminopurine Q7) were 
synthesized using 2'-deoxyinosine-CPG, dP-CPG and dK-CPG, respectively 
(Glen Research), as shown in Figure 5. The oligonucleotide primers containing 
Qi 3 Q2 and Qis at the Y-position were synthesized from Qi, Q2, and Qis derived- 
CPG synthesized from Qi (Johnson et al., "The Synthesis and Stability of 
Oligodeoxyribonucleotides Containing the Deoxyadenosine Mimic l-(2 f -Deoxy- 
Beta-D-Ribofuranosyl)Imidazole-4-Carboxamide," Nucleic Acid Res. , 25:559-67 
(1997)), Q2 (Bergstrom et ah, Journal of the American Chemical Society , 
1 17:1201-9 (1995)) and Qis (Zhang et al., "Exploratory Studies on Azole 
Carboxamides as Nucleobase Analogs: Thermal Denaturation Studies on 
Oligodeoxyribonucleotide Duplexes Containing Pyrrole-3-Carboxamide," Nucleic 
Acids Res. . 26:2208-15 (1998)) by the method of Pon et al. (Pon et al., 
"Derivatization of Controlled Pore Glass Beads for Solid Phase Oligonucleotide 
Synthesis," BioTechniques . 6:768-75 (1988)). 
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Example 2 - PCR Polymerases and Buffers 

The DNA polymerases used were AmpliTizg, AmpliJa^ - Stoeffel 
Fragment, Ampli7a^ - Fluorescent Sequencing (Applied Biosystems), Vent and 
Vent(exo ) (New England Biolabs), and Expand polymerase mix (Taq and Pfu 
polymerase mixture, in Expand High Fidelity kit, Boehringer Mannheim). The 
commercially available PCR buffers used were supplied in the AmpliTa^ and 
Expand High Fidelity kits. An alternative buffer CiNF, buffer G(f), is described 
elsewhere (Day et al. 5 Nucleic Acids Res , (1999)). Briefly, CiNF reactions contain 
20 mM citrate pH 7.6, 200 |Lig/ml bovine serum albumin, 2.5 mM MgC12, 200 jiM 
dNTP (each) and either 16 mM (NH^SCU, or 50 mM potassium acetate, 10% 
formamide, primers, and template DNA. All PCR and LDR reactions described 
below were performed under paraffin oil. 

Example 3 - Mismatch Extension Efficiency 

Primers containing natural bases and nucleotide analogs were used 
in PCR to compare and measure the efficiency of product formation from 
synthetic duplex p53 exon 7 templates having Mspl (CCGG), Taql (TCGA), Hhal 
(GCGC), or Tail (ACGT) sites at the Mspl position containing codon 248. The 
primers hybridized to wild-type sequence on either side of the Mspl site with the 
3' ends of the primers extending one base into the site on each side, see Figure 6A. 
Eight different analogs and the four natural bases were tested in parallel reactions 
on each of the four synthetic templates. PCR was performed using Stoeffel Taq or 
7b<7-Fluorescent Sequencing polymerases with the buffer supplied by 
manufacturer for each polymerase. 10 pmol of each primer and 20 finol of duplex 
template were used, and 0.2 mM each dNTP and 4 mM MgCh. Parallel reactions 
underwent 10, 20, 30, 40, and 50 PCR cycles of 94°C for 15 sec, 65°C for 1 min. 
Efficiency and yield were determined from samples run on 6% agarose gels and 
stained with ethidium bromide. 

Example 4 - Mismatch Conversion Product Sequencing 

Products most efficiently amplified by each analog were diluted 
1000-fold in water. The diluted DNA products were reamplified for 20 cycles of 



-48- 



94°C for 15 sec, 65°C for 2 min. using the same polymerase and buffer as in the 
previous PCR, but with the addition of 10 pmol of "zipcode" containing primers 
p53zip248 (SEP. ID. No. 5} and p53zip£4fe 248R TSEO. ID. No. 6Y shown in 
Figure 6A. Zipcode sequences are oligonucleotides with no known sequence 
similarity to DNA sequences in any organism. Amplification with zipcode 
primers is intended to specifically amplify the zipcode containing products of the 
previous PCR; i.e.,, only converted DNA (containing zipcodes) and not the nearly 
identical unconverted DNA (lacking zipcodes) will be amplified. Conversion 
products were run on 8% agarose gels and bands of the expected size excised. 
DNA was extracted from the gel slices by centrifiigation in a 235C 
microcentrifuge (Fisher) for 30 min through 0.45 jam HVLP filter (Millipore). 
The conversion product was dried and resuspended in ABI Dye Terminator Cycle 
Sequencing reaction mix with one of the zipcode primers according to kit 
instructions (Applied Biosystems). An equal volume (3 jil each) of sequencing 
reaction was combined with dye mix consisting of 83% formamide (Eastman), 4 
mM EDTA and 8 mg/ml Blue Dextran (Sigma). Samples were electrophoresed on 
a 7M urea, 10% acrylamide gel (19:1 bis, 0.6x TBE in gel and running buffer) in 
an ABI 373 DNA Sequencer. Data were analyzed using ABI 373A DNA 
Sequencer Data Analysis software version 1 .2.0. 

Example 5 - Conversion Product Identification, With and Without A 

Preconversion Step 

Conversion fidelity was tested using nine different synthetic templates, with and 
without preconversion using three primers containing Q5, Q6, and Q7 (see Example 
1). Preconversion PCR was performed with 3' analog primers prior to adding the 
desired natural base primers, in an effort to avoid mismatch primer extension. The 
50-basepair duplex DNA templates contained the wild-type p53 sequence 
surrounding codon 248, shown in Figure 6B, except for the bases corresponding to 
the Mspl site (CCGG). The following sequences were substituted at the Mspl 
position: 1) CCGG rwild-tvpe. SKO. TP. No. 2\ 2) CTOG fSEO. TP. No. 17^. 
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3) CGGG (SEP. ID. No. 18V 4) CAGG fSHO. TP, No. 19V 5) TCG AfSEO. IP. 
No. 20V 6) GCGC fSEO. TP. No. 21V 7) ACGT TSEP. TP. No. 22V 
81 A€GT CAGT (SEP. IP. No. 23V and g> OCGC fSEO. IP. No. 24V A pre- 
secondary PCR reaction ("preconversion") was performed with hot start using 50 
finol/nl p53-248Q N (SEP. IP. No. 31 and p51-248P*R fSEP. IP. No. 41 primers, 
and Vent (exo-) in CiNF buffer, buffer G(f), and 10 fmol/|il of duplex template. 
Preconversion used 2 PCR cycles of 94°C 15 sec, 55°C 1 min, 60°C 1 min. 
Product was stored at 4°C. Conversion reactions were started with 1 jul of 
preconversion reaction containing the same polymerase and buffer, but no 
additional template. Each reaction required 1 0 pmol of each primer, using one of 
the four pairs p53zip248N and p53zip248NR (N = C, T^Gk SEO. TP. No. 25: T. 
SEP. IP. N o. 9: G. SEP . IP. No. 26: or A. SEP. TP. No. 27V Parallel conversion 
reactions with no preconversion were initiated with a hot start by adding 1 0 finol 
of synthetic duplex template instead of preconversion reaction aliquot. PCR 
cycles were as follows: 5 cycles of 94°C 15 sec, 55°C + 1° per cycle 1 min, 60°C 1 
min; then 20 cycles of 94°C 15 sec, and 60°C 2 min. A final extension was 
performed at 60°C for 5 min. Polymerase was inactivated by freezing and thawing 
twice. Products were diluted lOx in water and reamplified by adding 1 |ul to 20 \xl 
of Expand polymerase and buffer mix. PCR was performed for 20 cycles (30 
cycles for low yield reactions) of 94°C 15 sec, 65°C 2 min using 12 pmol of 
zipcode primers Ztop (SEP. IP. No. 7^ and Zho t (SEP. IP. No. 8"> (Figure 6). 
LPR was performed as described below to identify the conversion products 
generated. 

Example 6 - Ligase Detection Reaction 

Ligase detection reactions were performed in standard LPR buffer 
(25 mM Tris pH 7.6, 12 mM MgCl2, 65 |ng/ml bovine serum albumin, 100 mM 
KC1, and 10 mM PTT). Each 20 jal reaction contained approximately 500 fmol of 
dsPNA (1 jul of sample), 500 fmol of each discriminating probe fSEP. IP. 
Nos. 1 1-14^. and 750 finol of common probe (SEP. IP. No. 15V as shown in 
Figure 6C. Sets of discriminating and common probes were synthesized to 
perform LPR on the expected conversion products and varied at the bases (Bi) 
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corresponding to the Mspl position sense strand (B1B2B3B4 = CCGG for wild- 
type). The discrimination probes had wild-type sequence and terminate in -BiB2(- 
OH 3'). The discrimination probes were synthesized as a set of four probes each 
with C, T, G and A in turn at B 2 . The common LDR probes had (5'P0 4 -)B 3 B4- 
followed by wild-type sequence, and hybridized to the template with its 5* base 
adjacent to the 3' base of a discrimination probe. Discrimination probes varied the 
3' terminal base to identify error products at B2 of the Mspl position. For 
simplicity, only B2 was monitored. LDR probes matched the expected conversion 
products; for example, conversion of -CCGG- template to -ACGT- required 
discrimination probes ending in -AC, -AT, -AG, and -AA, and a common probe 
with 5' pGT-. Discrimination probes had 5' tails of different length and a FAM 
label for fluorescence detection. The tail length allowed physical separation of 
different LDR products on an acrylamide gel, and thus identification of the LDR 
products. 

LDR reactions were preincubated for 1.5 min at 94°C prior to the 
addition of 5 nmol Tth ligase, followed by 10 LDR cycles of 94°C 15 sec, 65°C 2 
min, and a final hold briefly at 94°C. Reactions were cold quenched and stored at 
-70°C. The LDR products were separated on 10% acrylamide gels containing 7M 
urea, with 0.6x TBE (lx TBE: 90 mM Tris base, 90 mM Borate, 2 mM EDTA) in 
the gel and running buffer. Data were collected using an ABI 373 DNA sequencer 
with Genescan 672 software. 

Example 7 - Image Processing 

Gel pictures were produced by the ABI 672 Analysis software. 
Dye specific images were opened in Adobe Photoshop 3.0, cropped, resized, and 
converted to grayscale. The grayscale images were opened in NIH Image 1 .59, 
inverted, and ID vertical background was subtracted. The background-subtracted 
images were reinverted and rendered in pseudocolor by Photoshop to make 
intensity differences easier to compare. Except for color replacement, only linear 
image processing was performed to preserve relative intensities. 

Initial experiments were designed to determine the efficiency of 
generating PCR products when using probes containing 3 ' terminal nucleotide 
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analogs (see above Examples). Eight different analogs were designed to pair with 
more than one of the four natural bases in order to convert one base to another 
base at a specific position in a sequence. Primer pairs containing either a 
nucleotide analog or one of the four natural bases at their V ends were used to 
amplify four different templates (Figure 6A). Each nucleotide analog and natural 
base was mispaired (or paired) in turn with all four natural bases on the opposite 
strand, and amplification was attempted with either Taq Stoeffel Fragment or Tag 
Fluorescent Sequencing polymerases. The relative amplification efficiency was 
determined by the number of cycles required to generate visible product on an 
ethidium bromide stained agarose gel, as shown in Table 1 . The 50-base pair 
synthetic duplex DNA templates containing p53 sequence spanning codon 248 are 
distinguished by the four bases replacing the Mspl site, which are shown. 
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Table 1 



TCGA template 



CCGG 
template 



GCGC 
template 



ACGT 
template 



primer 
3' base 



reads A 
write (eff.) 



reads G 
write (eff.) 



reads C 
write (eff.) 



reads T 
write (eff.) 



T 
C 
G 
A 

Qi 
Q2 

Qs 

Q6 
Q7 
Ql6 
Ql8 
Ql9 



A,T 



A.G 



A.T 



(+++) 
(++) 
(++) 
(+) 
(±) 

(±)' 
(++) 

(+++) 
(+) 

(+)' 
(+)' 

(++)' 



A 
G 



(++) 
(+++) 
(++) 
(+) 
(±)' 

(±)' 
(++) 
(+++) 
(+) 

(-) 
(±)' 

(-) 



C 
T 



(++) 
(++) 
(+++) 
(+++) 
(-) 

(-) 
(+++) 

(++) 
(+++) 

(-) 
(±)' 

J*2_ 



T 

C 



T^A 



(++) 

(++) 
(+++) 

(+++) 

(+++) 
l 

(++)' 
(+++) 

(++) 
(+++) 

(-) 

(+++) 



1 



Low product yield. 



Table 1. Extension efficiency and conversion with 3' natural base and nucleotide analog primers. 
Four different templates were used to test primer extension from a 3' base or analog paired in turn 
with A, G, C and T. Relative efficiency was determined by the number of cycles required to 
generate visible product with Tag Stoeffel Fragment polymerase: 10 cycles, (++) 20 cycles, (+) 30 
cycles, (±) 40 to 50 cycles, (-) no product. Two of the natural base mismatch primer products were 
sequenced. Generally, the most efficiently amplified template for each analog was reamplified with 
truncated primers and sequenced to determine which bases are written opposite each analog- In 
one case (Qi) a lower efficiency extension product with higher yield was selected for sequencing. 
Mixed base writing preference for some analogs is indicated, with most frequent product listed 
first. 



Both Taq Stoeffel Fragment and Tag Fluorescent Sequencing 
polymerases generated comparable amounts of product. Perfectly matched natural 
base primers generated visible product after 10 cycles, however some analog 
primers generated no product after 50 cycles. The analogs that did amplify with 
high efficiency were those that were best able to "read" the opposite strand 
sequence, see Figure 4. 

One product for each analog (as well as the natural base controls) 
were reamplified and sequenced to determine polymerase preference in inserting 
nucleotide bases opposite the analog, shown in Table 1 . The Qi, Q5, Qe, Qi6 ? and 
Q18 primers generated detectable true conversion product, however only Qs 
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primers generated almost exclusively true conversion product. No single analog 
functioned as a "universal base" (Hill et al., "Polymerase Recognition of Synthetic 
Oligodeoxyribonucleotides Incorporating Degenerate Pyrimidine and Purine 
Bases," Proc. Natl. Acad. Sci. USA , 95(8):4258-63 (1998)) capable of generalized 
conversion. Unexpectedly, some products contained sequences that were difficult 
to read across the middle four bases, suggesting single base insertions or deletions 
occurred during PCR extension. This was especially prevalent in products 
generated from mismatched natural bases (see below). 

To test the ability of convertides to reduce mismatch extension 
errors, the effects of preconversion PCR cycles on fidelity were assessed. PCR 
products generated from template amplified with only natural base conversion 
primers were compared to products resulting from two initial PCR cycles using 
convertides followed by selective amplification using specific natural base 
primers. Preconversion PCR was performed with primer pairs containing Qs, Q6, 
and Q7 analogs, since these convertides had been shown to be the most efficiently 
extended. To improve overall PCR fidelity and 3 ' mismatch primer extension, 
CiNF buffer, buffer G(f), was used (Day et al., Nucleic Acids Res , (1999)). Nine 
different synthetic duplex templates (SEP. TP. Nos. 17-24^ containing mutated 
Mspl sites were amplified with or without preconversion using 3 ' analog 
preconversion primers. Both natural base conversion primers and 3' analog 
preconversion primers were designed to manipulate the outside bases CCGG of 
the Msp\ position, shown in Figure 6A-B. Some conversions were intended to 
serve as controls. In these cases, the original bases in the template were either 
restored after analog preconversion or never changed with full-length perfect 
match primers. All steps were performed identically between preconversion and 
non-preconversion reactions, except that preconversion reactions used as template 
the product of 2 cycles of convertide PCR for succeeding rounds of amplification, 
while synthetic duplex served as the starting material for PCR with no 
preconversion. In both cases, 3 ' natural base primers were used to selectively 
amplify the desired endproduct. These primers contained non-hybridizing zipcode 
sequences on their 5' ends, which ultimately served as primer binding sites for the 
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final 20-30 cycles of PCR, shown in Figure 6B. Conversion products were 
quantified by LDR, Figure 6C. 

Figure 7 shows the products of the experiment. Overall, natural 
base mismatch conversion generated greater than 80% incorrect conversion 
products, shown in Figure 7 A, lane 9, Figure 7B, lane 1, 3, 5, 7, 15 and 17), but 
preconversion could improve the fidelity and/or the yield of some conversions. In 
general, transversions were difficult to achieve even with preconversion. G— >C 
and A— >C conversion generated very little of the expected product with either the 
natural base or Qe primers (Figure 7A, lanes 1 1-14). Use of Q6 preconversion 
improved the yield of G— >T and A^T conversion products (compare natural base 
conversion in Figure 7B lanes 11,13 with preconversion in lanes 12 and 14). 
In the case of transitions, C— >T conversion produced unexpected one-base 
shortened artifacts with natural base mismatch primers on the CXGG templates 
(Figure 7B, lanes 1, 3, 5, 7, 15, and 17), but the correct products were generated 
when using Qe preconversion, as seen in Figure 7B lanes 2, 4, 6, 8, 16, and 18). In 
addition, Qe primers did improve the yield of the expected T^C conversion 
product, shown in Figure 7A lanes 9 and 10. The controls performed as expected: 
all C^C and T— >T nonconversion reactions worked correctly without convertides, 
seen in Figure 7A lanes 1, 3, 5, 7, 15, and 17, and 5B, lane 9, and the 
corresponding Q6 preconversion products were restored to the original sequence, 
shown in Figure 7A lanes 2, 4, 6, 8, 16, and 18, and Figure 7B, lane 10). In 
summary, preconversion reduced or eliminated artifacts produced by natural 

base C^T and T— >C conversion and facilitated transitions in general. 
Transversions were only partially successful: G— >T and A^T conversions could 
be improved with preconversion, but G— »C and A— >C conversion could not be 
achieved. 

Apparently correct conversions were observed with attempted 
C— >G and C^A transversions, however, carefully designed control templates 
revealed that these "conversions" were artifactual. C^G and C^A conversion 
appeared to be successful for templates containing a central CpG dinucleotide 
(Figure 8A and 8B, lanes 1-3, 13-21). However, the same final conversion 
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products were observed with other templates lacking the central CpG dinucleotide, 
now clearly incorrect. For example, a GCGC product resulted during G 
conversion in reactions where the expected product should have contained T, G or 
A in the second position (Figure 8 A lanes 4-12). Also, an ACGT product resulted 
during a conversion where the expected product should have inserted a non-C base 
in the second position (Figure 8B lanes 4-12, and 22-27). The mismatch primers 
used to alter the outer bases of the recognition site did not reach the central 
dinucleotide, yet these bases were altered. It is doubtful the "successful" 
conversions occurred through the intended mechanism, and thus represent 
fortuitous artifact. The yield of LDR product was low for two palindromic 
templates despite efficient PCR (lanes 22-27 in both Figure 8A and Figure 8B). 
These conversion reaction products presumably contain a large fraction of 
insertions or deletions, which cannot be detected by the current set of LDR probes. 
In summary, C— >G conversion was partially accomplished by both Q5 (Figure 8A, 
lanes 5, 8, 11, and 23) and the natural base G (Figure 8A, lanes 4, 7, 10, and 22); 
however, preconversion does not appear to improve conversion. C— >G conversion 
exhibits sequence dependence. 

The results of the preconversion study indicate that errors in natural 
base conversion were prevalent, but the use of Q5, Q6 and Q7 convertides in 
preconversion reduced polymerase error in certain cases. In terms of conversion 
reactions, transitions were easier to accomplish than trans versions. This is in 
agreement with previous findings: Newton et al. observed more errors in 
extension of primers with 3' terminal C-T, A-A, and T-T mismatches 
(transversions) than with purine-pyrimidine mismatches (transitions) (Newton et 
al., "Analysis of Any Point Mutation in DNA. The Amplification Refractory 
Mutation System (ARMS)," Nucleic Acids Res. 17(7):2503-16 (1989)). Here, 
pyrimidine-pyrimidine conversion usually generated the expected product, 
especially when using convertides. In cases of purine-pyrimidine, and pyrimidine- 
purine conversion, incorrect products were often generated. A summary of results 
is shown in Table 2, below. Formation of incorrect conversion products can be 
explained in part by a transient base-pair slippage of the primer 3' nucleotide (or 
analog) to a misaligned position on the template, as shown in Figure 9. As a 
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result, the sequence following the mismatch is not complementary to the original 
template. Consistent with this hypothesis is the observation of unreadable 
sequence immediately following the analog in initial sequencing experiments. 
Palindromic products, especially CpG dinucleotides, are themselves prone to 
slippage and extension. Palindromic products were frequently produced from 
non-palindromic templates. These artifacts were reduced by the presence of 10% 
formamide in the PCR buffer, presumably through destabilization of misaligned 
structures. Finally, nucleotide analogs produced fewer artifacts than natural bases. 
Different analogs produced different kinds and quantities of artifacts, perhaps 
according to their relative ability to base pair and stabilize a slippage 
misalignment. Thus, if polymerase extension is slow from an analog poorly base- 
paired with the template, extension from a strong transient base pair generated by 
slippage could exceed the rate of extension from a weakly base-paired 3 ' terminal 
base. 
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Table 2 



Starting 
Template? 1 




C 


First base converted to 
T G 


A 


1 


CCGG 


c 


Q6 


Q7 (FP) 


Qs (FP) 


2 


CTGG 


c 


Q6 


X (err C) 


X (err C) 


3 


CGGG 


c 


Q6 


Qs (err C) 


X (err C) 


4 


CAGG 


c 


Q6 


G (err C) 


X (err C) 


5 


TCGA 


Q6 


Tor Q 6 


Qv (FP) 


Qs (FP) 


6 


GCGC 


X (en- G) 


Q6 


G 


Qs or Q 7 


7 


ACGT 


X (en- G) 


Q6 weak 


Q7 


A or Q7 


8 


CATG 


C 


X 


Qs (err C) 


X (err C) 


9 


CGCG 


C 


06 


X 


Q7 (err C) 



a The 50 bp synthetic duplex DNA templates containing p53 sequence spanning codon 248 are 
distinguished by the four bases replacing the Mspl site, which are shown. 

Table 2: For the most effective conversions, see Figures 5 and 6. Nine duplex DNA templates 
were used in conversion reactions. Each contained sequence identical to p53 surrounding codon 
248, except the Mspl site was replaced by a different four base sequence (B1B2B3B4). Bi and B4- 
(opposite strand) were simultaneously converted in turn to C, T, G and A either directly by PCR 
using natural base primers, or by preconversion PCR with nucleotide analog primers followed by 
PCR with natural base primers. In nonconversion control reactions the "conversion" product is 
identical to the original template. A natural base is used to indicate control reactions, and cases in 
which preconversion did not improve conversion. Preconversion was performed using to 
facilitate conversion to C and T, and using Q5 and Q7 to facilitate conversions to G and A. 
Conversion primers detennine Bi and B4; LDR was performed to detect unintended base changes 
in B2 (which ideally is unchanged after conversion). Conversion improved by preconversion is 
indicated by the nucleotide analog used. Preconversion equally as effective in control reactions as 
natural base primers is also indicated by the analog used. Low conversion fidelity results in large 
B2 error. Major B2 error products are identified (e.g. err C indicates C at B2), and the absence of 
correct product indicated no conversion method was successful (X = no correct product). 
Apparently correct product probably formed through a fortuitous mechanism is indicated (FP = 
false positive). 

As discussed earlier, PCR-RPLP has been widely used to detect rare mutations. A 
limitation of this technique is reliance on serendipity to yield mutations that can be 
modified to create restriction sites in either the wild-type or the mutant template. 
A second limitation imposed on this approach is the need to avoid using 3' 
terminal mismatch primers, since extension from these primers is known to be 
error-prone. To date, the majority of successful attempts have used interrupted 
palindromic restriction sites to avoid using 3' terminal mismatch primers. 
Mutations in the cancer causing K-ras and H-ras were detected at a sensitivity of 1 
in 10 5 using PCR/RFLP with interrupted palindromic enzymes Xmnl (Kumar et 
al., "Oncogene Detection at the Single Cell Level," Oncogene . 3(6):647-51 
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(1988)), AlwNl (Anderson et al., "Prevalence of RAS Oncogene Mutation in Head 
and Neck Carcinomas" J. Otolaryngol. . 21(5):321-6 (1992)), and^/NI or Mval 
(Urban et al., "Detection of C-Ki-Ras Mutation by PCR/RFLP Analysis and 
Diagnosis of Pancreatic Adenocarcinomas," J. Natl. Cancer Inst. . 85(24):2008-12 
(1993); and Ronai et al., "Quantitative Enriched PCR (QEPCR), a Highly 
Sensitive Method for Detection of K-Ras Oncogene Mutation," Hum. Mutat . 
10(4):322-5 (1997)). These PCR-RFLP experiments and others (Hattori et al., 
"Mismatch PCR RFLP Detection of DRD2 Ser31 ICys Polymorphism and 
Schizophrenia," Biochem. Biophys. Res. Commun. . 202(2):757-63 (1994); 
Beutler et al., "The Facile Detection of the Nt 1226 Mutation of 
Glucocerebrosidase by 'Mismatched 1 PCR," Clin. Chim. Acta. . 194(2-3): 161-6 
(1990); Hingorani et al., "A Simple Molecular Assay for the CI 166 Variant of the 
Angiotensin II Type 1 Receptor Gene," Biochem. Biophys. Res. Commun, , 
213(2):725-9 (1995); Kuwata et al., J Allergy Clin Immunol . 96(6 Pt 2):1051-60 
(1995); Nishiwaki et al., "Mutational Screening of APP Gene in Patients with 
Early-Onset Alzheimer Disease Utilizing Mismatched PCR-RFLP," Clin. Genet. . 
49(3):1 19-23 (1996); and Ishihara et al., "Analysis of Allelic Variation of the 
TAP2 Gene in Sarcoidosis," Tissue Antigens . 49(2):107-10 (1997)) avoid 3' 
terminal mismatches, however most cancer mutations are in sequences that cannot 
be converted to interrupted palindromes, for example at CpG dinucleotides. 
A larger fraction of mutations would be made into targets for detection if 
contiguous recognition sequences could be introduced with as few errors as 
interrupted palindromic recognition sequences. Currently, contiguous restriction 
sites are introduced by terminal 3' mismatch primer extension, which is prone to 
errors. O'Dell et al. tested a general method for introducing different restriction 
sites at CpG dinucleotides using mismatch PCR (O'Dell et al., "PCR Induction of 
a TaqI Restriction Site at Any CpG Dinucleotide Using Two Mismatched Primers 
(CpG-PCR) ," Genome Res. . 6(6):558-68 (1996)). The outer bases of four 
different CpG dinucleotides in the human LDL receptor gene were altered to 
create Taq\ (TCGA), Mspl (CCGG) or Rhal (GCGC) sites. In these targets, Taql 
sites were successfully generated by 3' T mismatch primers. The method was able 
to detect homozygous and heterozygous individuals; however, the ratio of 
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products representing each allele was not equal, as is expected in germline 
mutations. Several cases have been shown here where T mismatch conversion 
failed to create a Taq\ site. Thus the method is sequence dependent. O'Dell et al. 
found that C and G mismatch conversion failed. These results agree with O'Dell's 
conclusion that stronger base pairing leads to mispriming, possibly through 
stabilization of primer slippage on the template. Gotoda et al., "Detection of Three 
Separate DNA Polymorphisms in the Human Lipoprotein Lipase Gene by Gene 
Amplification and Restriction Endonuclease Digestion," J. Lipid Res. . 
33(7):1067-72 (1992), claim to have successfully used PCR-RFLP to introduce a 
Maell site (ACGT) by extension of a 3 ' C-A mismatch to produce a T— >C 
transition (Gotoda et aL, "Detection of Three Separate DNA Polymorphisms in the 
Human Lipoprotein Lipase Gene by Gene Amplification and Restriction 
Endonuclease Digestion," J. Lipid Res. . 33(7): 1067-72 (1992)). Athma et al. used 
PCR extension of a 3' terminal mismatch primer to create a restriction site for 
discriminating between two alleles (Athma et al., "Single Base Polymorphism 
Linked to the Ataxia-Telangiectasia Locus is Detected by Mismatch PCR," 
Biochem. Biophys. Res. Commun. . 210(3):982-6 (1995)). A G-T mismatch 
produced an Mval site (CC A/T GG) through an A— >G transition. Successful 
A— >G conversions have been performed in accordance with the method of the 
present invention, using a natural base mismatch, but difficulties with T^C 
conversion by natural base primers has been encountered. Transitions can be 
accomplished more readily than transversions, but the yield of correct product can 
be sequence dependent. Others have also found that PCR-RFLP can produce 
false-positive results (Hodanova et al., "Incorrect Assignment of N370S Mutation 
Status by Mismatched PCR/RFLP Method in Two Gaucher Patients," J. Inherit. 
Metab. Pis. . 20(4):61 1-2 (1997)). The use herein of ligase detection reaction 
allows the determination of the precise amounts of misextension products 
generated. 

The fidelity of polymerase extension from primers containing 3 ' 
natural bases and nucleotide analogs has been measured. Results indicate that 
natural base mismatch primer extension cannot be used as a general technique to 
create restriction sites in any given sequence for RFLP analysis. Primer slippage 
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appears to be an important mechanism for producing error in mismatch primer 
extension. This source of error may have a dramatic impact on some allele- 
specific PCR and other methods of high sensitivity mutation detection. With 
further development and testing of nucleotide analogs to facilitate conversion, 
mismatch primer extension may become a technique that can efficiently introduce 
desired mutations with few artifacts. It has been found some nucleotide analogs 
improve mismatch primer extension (see Table 3, below). Further improvement 
of 3 ' mismatch extension will be required to minimize the high degree of context 
dependent error seen in transversions and lead to improved levels of sensitivity 
and broader scope of PCR/RFLP based mutation detection. 

Table 3 
Conversion to 

Starting 



Base C T G A 

C C Q 6 

T Q 6 T 

G G A or Q? 
A Qs orQ? A 



Table 3. Summary of conversion strategy. A Q n convertide indicates preconversion is required 
using the indicated convertide prior to final conversion using natural base primers. In some cases, 
an additional convertide or using only the natural base will result in the desired conversion. 

Example 9 - Oligonucleotide Synthesis for PCR/RE/LDR 

Oligonucleotides were synthesized at the 0.2 \xmole scale by 
cyanoethyl phosphoramidite chemistry on an Applied Biosystems 394 DNA 
synthesizer. Standard 500A CPG columns and reagents (Applied Biosystems) 
were used with the following exceptions: oligonucleotides 50 bases in length 
were synthesized using wide-pore 1000A CPG columns (Applied Biosystems); 
oligonucleotides with fluorescent dye FAM at the 5' terminus were synthesized 
using FAM phosphoramidite (Applied Biosystems) with a 1 5 minute coupling 
step; oligonucleotides with 5' phosphate were synthesized using phosphorylation 
reagent (Glen Research) with a 15 minute coupling step. Oligonucleotides with 3' 
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blocking group were synthesized using 3 '-Spacer CPG columns (Glen Research). 
Oligonucleotides with the 3' nucleotide analog 6-(2-deoxy-B-D-ribofuranosyl)- 
6H ? 8H-3 ? 4-dihydropyrimido[4 ? 5-c][l,2]oxazine-7-one (Qe) were synthesized using 
dP-CpG (Glen Research). 

Example 10 - PCR Polymerases and Buffers for PCR/RE/LDR 

The polymerases used were AmpliTaq (Applied Biosystems), Vent 
and Vent(exo-) (New England Biolabs), and Expand polymerase mix (Taq and 
Pwol polymerase mixture, in Expand High Fidelity kit, Boehringer Mannheim). 
The commercially available PCR buffers used were supplied in the AmpliTaq and 
Expand kits. Tris pH 9.1 (pH values were measured using 1 M stock solutions at 
23°C), tricine pH 8.7, EPPS (N-[2-hydroxyethyl]piperazine-N 5 -3-propanesulfonic 
acid) pH 8.4, and citrate pH 7.6 (Sigma) were used for alternative PCR buffers. 
Unless otherwise noted, each 20 jal reaction contained 20 mM Tris, tricine, or 
citrate, 200 pg/ml bovine serum albumin, 2.5 mM MgC12, 200 jiM dNTP (each) 
and either 16 mM (NH^SCU, or 50 mM potassium acetate. Formamide at 10% 
concentration was used as indicated (see Example 1 1 , below). PCR buffers were 
made as 10% stocks requiring the addition of formamide as needed, dNTPs, and 
the oligonucleotide primers and template DNA. 

Example 11 — Enzyme Buffer Notation 

Test PCR buffers are named to indicate the presence of one or more 
components: Tris/potassium acetate = buffer A; Tris/ammonium sulfate = 
buffer B; Tricine/ammonium sulfate = buffer D; EPPS/potassium sulfate = buffer 
E; EPPS/ammonium sulfate = buffer F; and Citrate/ammonium sulfate = buffer G. 
Component concentrations are described above. 

Example 12 - Amplification of p53 exon 7 from Genomic DNA 

Part of p53 exon 7 surrounding codon 248 was amplified as shown 
in Figure 1 0. The upstream primer has nucleotide sequence corresponding to 
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SEQ. ID. NO. 1^35. as follows: 

5 '-GCCTCATCTTGGGCCTGTGTTATC-3 ' 

hybridized within the preceding intron, and the downstream primer, with a 
nucleotide sequence corresponding to SEQ. ID. NO. S^lfL as follows: 

5 '-GTGGATGGGTAGTAGTATGGAAGAAATC-3 ' 

hybridized within exon 7. All PCR, restriction endonuclease digestion and 
ligation steps described throughout were performed using a GeneAmp PCR 
System 2400 (Perkin-Elmer). Several buffers and enzymes were used as indicated 
in Examples 10 and 11. The p53 exon 7 amplification from genomic DNA was 
performed starting with a 20 \xl reaction mixture containing 50 ng of DNA, 2.5 
mM of each dNTP, and 12.5 pmol of each primer in lx buffer without polymerase. 
The reaction mixture was covered with paraffin oil and preincubated for at least 
1.5 min at 94°C in order to perform hot start by adding 1 jal of polymerase diluted 
in lx buffer to introduce the required units of polymerase. The exon 7 segment 
was amplified for 40 cycles of 94°C 1 5 sec, 65°C 2 min, with an additional 5 min 
at 65°C at the end of the last cycle. PCR amplifications departing from this 
procedure were performed as indicated. 

Example 13 - PCR/RE/LDR: Fidelity Assay 

Templates were amplified with conversion primer pairs bracketing 
the central two basepairs of the Mspl site (CCGG) at codon 248 (Figure 1 IB). 
Tubes were prepared containing 10 frnol per reaction of either PCR amplified p53 
exon 7 or wild-type synthetic duplex template, PCR buffer, and primers. In 
parallel reactions, a synthetic 50 bp duplex marker template (MK), with the 
sequence CGGG replacing the Mspl site at codon 248, was added at 10" 3 , 10" 4 , 10 5 
and 0 molar ratio to wild-type template. Reactions were preincubated for at least 
1 .5 min at 94°C with all components present in CiNF buffer, buffer G(f), except 
Vent (exo-) polymerase. A "hot start" was performed by adding 1 jal of 
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polymerase at 94°C. When preconversion was performed, 2 cycles of 94°C 15 
sec, 55°C 1 min, 60°C 1 min were executed with 500 fmol each of the primers 
p53-248Q 6 and p53-248Q 6 R. Afterwards, 1 pmol of p53Taq248T fSEO. TP. 
No. 30^ and p53Taq248T R fSEO. ID. No. 3n primers were added. When 
preconversion was not performed, the reactions contained 1 pmol each of the 
primers p53Taq248T and p53Taq248TR or the control primers p53Msp248C and 
p53Msp248CR. After reactions with and without preconversion were performed, 
conversion PCR was carried out as follows: 5 cycles of 94°C 15 sec, 55°C + 
1 %yc 1 min (temperature ramp), 60°C 1 min; then, 20 cycles of 94°C 1 5 sec, 
60°C 2 min; then, a final 60°C 5 min extension. After 3 cycles of the temperature 
ramp 10 pmol of long zipcode conversion primers (p53zip248T (SEP. TP. No. 9} 
and P 53zip248TR TSEO. IP. No. 10V or P 53zip248C rSEO. ID. No. 25} and 
p53zip248CR)^ were added. After conversion, the wild-type PNA was digested 
periodically during 20 cycles of "zipcode" PCR (described below). Polymerase 
was inactivated by freezing and thawing twice. Finally, LPR was performed to 
detect the conversion products without contribution from the original template 
(except in nonconversion control reactions). 

Example 14 - PCR/RE/LDR: "Zipcode" PCR 

Wild-type sequences or wild-type conversion products were 
removed by restriction digestion. The appropriate restriction endonuclease was 
added to the reaction tube and supplemented with additional MgCh as required to 
allow efficient digestion. Mspl digestion was performed at 37°C for 15 min using 
no additional MgCl2 3 except when using citrate buffer. Taql digestion was 
performed at 65 °C for 30 min at 6 mM Mg 2+ by adding 1 \i\ of enzyme diluted in 
140 mM MgCl2. The undigested conversion products were reamplified from 1 jliI 
of a 1 Ox dilution added to a 20 ^il PCR reaction containing 10 pmol of the 
"zipcode" primers Ztop fSEO. IP. No. 7^ a nd Zhot f SEP. TP. No. 8^ (Figure 6B). 
These zipcode primers each contain a PNA sequence that is not similar in 
sequence to any genomic sequences present in the sample, thus only the products 
of previous PCR using primers containing the zipcode sequences will be 
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efficiently amplified. Conversion products were amplified using Expand 
polymerase mix and buffer (see Example 1 0) After an initial RE digest, zipcode 
PCR reamplification followed by redigestion was performed as follows: reactions 
were preincubated at 94°C for at least 1.5 min then initiated with a hot start by 
adding 0.1 jal of RE digested sample (1 jlxI of a lOx dilution) to a 20 \i\ reaction; 10 
cycles of 94°C 15 sec, 65°C 2 min. Zipcode PCR amplification products were 
redigested as described above. 

Example 15 - PCR/RE/LDR: Ligase Detection Reaction 

Ligase detection reactions were performed in standard LDR buffer 
(25 mM Tris pH 7.6, 12 mM MgCh, 65 ng/ml bovine serum albumin, 100 mM 
KC1, and 10 mM DTT). Each 20 jil reaction contained approximately 500 finol of 
dsDNA (1 jal of PCR sample), 500 finol of each discriminating probe fSEO. ID. 
Nos. 1 1-14^. and 750 fmol of common probe f SEP. TP. No. 15"L shown in Figure 
11C. Sets of discrimination and common probes were synthesized to detect the 
expected conversion products; i.e., converted to CNGG or TNGA at the Mspl 
position. The common probe was synthesized using 3'-Spacer C3 CPG columns 
and the 5' end was phosphorylated on the column using phosphorylation reagent. 
Discrimination probes of each set varied at the 3' terminal base to query the base 
in that location, i.e., the second base of the Mspl position. Discrimination probes 
had 5' tails of different length and a FAM label for fluorescence detection. The 
tail size identified the probe and allowed physical separation of different LDR 
products on an acrylamide gel. 

The LDR reaction was preincubated for 1.5 min at 94°C prior to 
the addition of 5 nmol Tth ligase enzyme under a layer of mineral oil. 10 LDR 
cycles of 94°C 15 sec, 65°C 2 min. were used. The reactions were then held at 
94°C until cold quenched on ice and stored at -70°C. The LDR products were 
separated on 10% acrylamide gels containing 7M urea with 0.6x TBE (lx TBE 
contains 90 mM Tris base, 90 mM Borate, 2 mM EDTA) used in the gel and for 
the running buffer. Data were collected using an ABI 373 automated DNA 
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sequencer and Applied Biosystems Genescan 672 software (GS Collection and GS 
Analysis). 
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Example 16 - Image Processing for PCR/RE/LDR 

Raw gel pictures were produced by the ABI GS Analysis software. 
Dye specific pictures were opened in Adobe Photoshop 3.0, cropped, resized, and 
converted to grayscale. The grayscale images were opened in NIH Image 1.59, 
inverted and ID vertical background was subtracted. Optionally, NIH Image 
could render a 3-D plot from a corrected 2-D picture. Background corrected 
pictures were reinverted and rendered in pseudocolor by Photoshop by replacing 
the color table to make subtle intensity differences easier to compare. Except for 
color replacement, only linear image processing was performed in order to 
preserve relative intensities. 

Example 17 -PCR/RE/LDR Optimization 

In Figure 10, PCR/RE/LDR was developed to detect and identify 
low abundance mutations occurring within the Mspl site (CCGG) at codon 248 in 
the p53 gene. An initial PCR amplifies exon 7 from genomic DNA, shown in 
Figure 10A. This product serves as the template for a second PCR that amplifies 
the central CpG dinucleotide in the Mspl site. To generate a restriction site in 
sequence lacking a preexisting site, mismatch primers are used to alter one or 
more bases flanking the CpG dinucleotide. This results in a conversion PCR that 
creates a restriction site (NCGN — > TCGA) Taql site, for example, as in Figure 
10B. In a generalized method for introducing contiguous Type II restriction sites, 
conversion PCR primers by necessity have 3' terminal mismatches. To avoid 
unfavorable natural base mismatches that may result in insertion of an erroneous 
base at the next site (O'Dell et al., Genome Res. . 6(6):558-68 (1996); and Eiken et 
al., "Application of Natural and Amplification Created Restriction Sites for the 
Diagnosis of PKU Mutations," Nucleic Acids Res. . 19(7): 1427-30 (1991)), 
preconversion with 3' nucleotide analog primers is performed. However, 
extension with 3' analog primers produces a pool of degenerate products (Day et 
al., Nucleic Acids Res. > (1999)). Thus, after this preconversion step, natural base 
primers are used to selectively amplify the desired products. 

Mismatch conversion error relative to PCR error was assessed by 
performing parallel nonconversion control reactions and true conversion reactions 
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with and without preconversion. Nonconversion reaction products retained the 
Mspl site (CCGG) (see Figure 1 OA), while conversion introduces a Taql site 
(TCGA) (see Figure 10B). All PCR/RE/LDR steps were performed under similar 
conditions, varying only the primers and restriction endonuclease (Mspl or Taql). 
In both cases, noncleavable DNA is preferentially amplified. When wild-type 
DNA is selectively removed by digestion, it is necessary to determine the 
proportion of DNA with incorrect sequence produced relative to the initial 
quantity of DNA in the sample, which is nearly 100% wild-type. Parallel 
reactions were performed in which known fractions of marker (MK) DNA were 
present. The MK DNA contained a single base change in the Mspl site (CG GG), 
rendering it uncleavable by Mspl. C^G transversions are unlikely to occur 
through polymerase error. The MK standard curve allows quantification of 
mutations detected by LDR. Figure 10C shows the LDR quantification of mutant 
sequencing using a marker. PCR conditions were tested to minimize PCR error 
(observed in the nonconversion reactions) and mismatch extension errors 
(additional errors observed in the conversion reactions). 

Various proofreading and nonproofreading polymerases were 
tested, as different polymerase properties are required during target amplification 
from genomic DNA, conversion, and reamplification steps in PCR/RE/LDR. 
Since it is essential throughout PCR/RE/LDR to minimize any alteration of the 
bases assayed by LDR, proofreading polymerases might seem the logical choice 
for maintaining the highest fidelity (Keohavong et al., PCR Meth. AppL . 2:288-92 
(1993)); however, they may interfere with conversion by mismatch primer 
extension. Hence, PCR conditions must be found which maximize the fidelity of 
nonproofreading polymerases (Keohavong et al., Proc. Natl. Acad. Sci. USA . 
86)23):9253-7(1989)). 

Initially, PCR/RE/LDR was used as a high sensitivity assay to 
determine PCR conditions that maintain the highest fidelity throughout the 
procedure. Two main sources of error were expected: 1) polymerase 
misincorporation, and 2) DNA template degradation. Raising the PCR buffer pH 
improves long PCR, probably by decreasing depurination which leads to strand 
cleavage (Barnes, "PCR Amplification of up to 35-Kb DNA with High Fidelity 
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and High Yield from Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. 
USA . 91(6):2216-20 (1994); Cheng et al., "Effective Amplification of Long 
Targets from Cloned Inserts and Human Genomic DNA," Proc. Natl. Acad. Sci. 
USA . 91(12):5695-9 (1994); and Sang et al. 5 "Generation of Site-Directed 
Mutagenesis by Extralong, High-Fidelity Polymerase Chain Reaction," Anal. 
Biochem. . 233(1): 142-4 (1996)). While higher pH may decrease template 
damage, higher pH is also known to adversely affect polymerase fidelity (Eckert et 
al., "High Fidelity DNA Synthesis by the Thermus Aquaticus DNA Polymerase," 
Nucleic Acids Res. . 18(13):3739-44 (1990); Eckert et al., "DNA Polymerase 
Fidelity and the Polymerase Chain Reaction," PCR Methods AppL . 1(1): 17-24 
(1991); and Cline et al., "PCR Fidelity of Pfii DNA Polymerase and Other 
Thermostable DNA Polymerases" Nucleic Acids Res. . 24(18):3546-51 (1996)). 
Therefore, other buffers were also tested: tricine, EPPS, and citrate buffers, which 
have pKaS in the range of 7 to 8 and | ApKa| lower than Tris. Tris cannot meet the 
dual constraints of a slightly alkaline pH at high temperature to maintain template 
integrity and neutral pH at the extension temperature to maintain polymerase 
fidelity, although most PCR fidelity and long PCR studies use Tris. Some 
investigators have explored the use of alternative buffers with lower |ApKa| 
(Eckert et al., "High Fidelity DNA Synthesis by the Thermus Aquaticus DNA 
Polymerase," Nucleic Acids Res. . 18(13):3739-44 (1990); Eckert et aL, "DNA 
Polymerase Fidelity and the Polymerase Chain Reaction," PCR Methods Appl. , 
l(l):17-24 (1991); and Brail et al., "Improved Polymerase Fidelity in PCR- 
SSCPA," Mutat. Res. . 303(4):171-5 (1993)). Buffer-specific effects on PCR were 
tested for purposes of optimizing the buffer conditions of the present invention by 
making PCR buffers containing identical components except for the buffering 
compound. Making one set of test PCR buffers with ammonium sulfate and 
another with potassium acetate tested salt effects. The ApK* of each buffer was 
determined in pure solution and in lx PCR buffer mixtures (data not shown). 
Results of these tests agreed with published ApKa values of pure buffers (Good, 
"Amine Buffers Useful for Biological Research," in Fasman, ed., Handbook of 
Biochemistry and Molecular Biology , Cleveland, Ohio: CRC Press, Inc., pp. 367- 
369 (1976); and Blanchard, "Buffers for Enzymes," Meth. Enzymol. . 104:404-14 
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(1984)) corrected by a small constant (.005 pH units/°C), possibly due to a 
temperature-dependence of the pH probe itself. The pH of the each test PCR 
buffer was adjusted to produce approximately neutral pH at 65°C. However, the 
1 x PCR buffers had somewhat different ApKaS compared to the pure buffers; for 
example, lx TsN, buffer B, had ApKa = -.033/°C versus -.030/°C for 100 mM Tris, 
and lx TcK, buffer C, had ApKa = -.022/°C versus -.025/°C for 100 mM tricine. 

Test PCR buffers containing Tris, tricine, or EPPS were used to test 
PCR fidelity with no conversion of the Mspl site (CCGG) at codon 248 of p53 
(Figure 12). The objective in this experiment was to test the error rate of PCR 
using various buffers and polymerase enzymes. Since introduced errors create 
template that cannot be cleaved by the selected restriction enzyme, false-positives 
accumulate as this error template continues to amplify alongside true mutant 
DNA. This established the conditions necessary to achieve amplification while 
minimizing error. The same polymerase and buffer set was used in both 
preamplification of p53 exon 7 from genomic DNA and in the "conversion" step. 
As mentioned, the "conversion" step maintains the Mspl site by using perfectly 
matched primers whose 3 ' ends terminate on the C and G bases flanking the 
central CpG. After an initial Mspl digest, template and amplification products 
were periodically redigested every ten cycles during reamplification to remove 
WT sequence. Synthetic marker mutant MK with the sequence CGGG was 
present in these reactions at 10~ 3 , 10" 4 , or 10~ 5 and 0 ratio to wild-type (WT). MK 
will not be cleaved by Mspl restriction digestion, but will amplify with each PCR 
cycle to provide an internal control to measure product quantities (see below). 
The MK product will also maintain its sequence, as the perfect match primers in 
the conversion step will again terminate on the C and G bases flanking the central 
GG. Error products resulting from MK PCR will in general lack Mspl sites and 
will be indistinguishable from regular MK template. If an Mspl site is accidentally 
created, the product will be destroyed by digestion. If false LDR error products 
are also generate, these can be detected by comparison with a "no marker" control. 

For each buffer, LDR detected MK products in each of the four 
parallel reactions, with the 0 MK control indicating the background level of 
CGGG error produced. The intensities of other error products detected by LDR 
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were compared to MK to estimate the fraction of each error product generated. 
AmpliTaq generated few transversions (C->G or C— »A), but a large amount of 
C^T transition was observed, as shown in Figure 12 A. Vent generated much less 
of the C— >T transition compared to AmpliTaq (Figure 12B). AmpliTaq showed 
little dependence on the presence of potassium acetate in buffers A, C, and E 
(Figure 12A lanes 1-4, 9-12, 17-20) versus ammonium sulfate in buffers B, D, and 
F (Figure 12A lanes 5-8, 13-16, 21-24). Vent polymerase amplified template 
more efficiently in Tris/ammonium sulfate buffer B than Tris/potassium acetate 
buffer A (Figure 12B lanes 1-4 versus lanes 5-8), as described previously (Barnes, 
"PCR Amplification of up to 35-Kb DNA with High Fidelity and High Yield from 
Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. USA . 91(6):2216-20 
(1994); Keohavong et al., PCR Meth. AppL , 2:288-92 (1993); Cariello et al., 
"Fidelity of Thermococcus Litoralis DNA Polymerase (Vent) in PCR Determined 
by Denaturing Gradient Gel Electrophoresis," Nucleic Acids Res. . 19(15):4193-8 
(1991); and Mattila et al., "Fidelity of DNA Synthesis by the Thermococcus 
Litoralis DNA Polymerase— An Extremely Heat Stable Enzyme with Proofreading 
Activity." Nucleic Acids Res. . 19(18):4967-73 (1991)). However, Vent exhibited 
improved fidelity in tri cine/potassium acetate buffer C (Figure 12B lanes 9-12) 
and EPPS buffer E (Figure 12B lanes 17-20) compared to tricine/ammonium 
sulfate buffer D (Figure 12B lanes 13-16) and EPPS/ammonium sulfate buffer F 
(Figure 12B lanes 21-24). 

The relative fidelities of the different polymerase-buffer 
combinations may be described by their "sensitivity" expressed as the — logio of the 
ratio of MK to WT initially present. The C— >T error for AmpliTaq amplification 
in Tris/potassium acetate buffer A can be taken as an example. If the signal for 
the CTGG error product (Figure 1 2A lane 2) is compared to the MK CGGG signal 
(Figure 12A lanes 1-3), the intensity of the signal most resembles the 10~ 3 MK:WT 
dilution (Figure 12A lane 1). Thus, the C— »T error rate is 10" 3 ; the sensitivity is 3, 
since -log[MK/WT] = -log[10" 3 ] = 3. From this it can be seen that the higher the 
sensitivity, the lower the error rate. Reactions with higher sensitivities for each 
mutation had the highest overall fidelity (Results summarized in Table 1). Many 
of the Vent reactions had sensitivities of 5 (i.e., 1 in 10 5) for every mutation 
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(Figure 12B), while the AmpliTaq' reactions had sensitivities of 3 (i.e, 1 in 10 3)) > as 
shown in Figure 12A. Sensitivity indicates the usefulness of the assay rather than 
the error rate of the polymerase. Error (ER) per base per cycle may be estimated 
from the fraction (F) of all mutations occurring at one base which accumulated 
over 65 cycles (D) before digestion. For purposes of the present invention, the 
number of cycles is an estimate of the number of duplications, since multiple non- 
saturating PCRs were performed. From ER— F/D, Vent polymerase had an error 
rate of less than 1 x 10~ 7 /base/cycle in tri cine/potassium acetate buffer C, 
approximately 2 x 10" 7 /base/cycle in tricine/ammonium sulfate buffer D, and 2 x 
10" 6 /base/cycle in TsN, buffer B. An error rate of 2 x 10~ 5 /base/cycle was 
observed. This was due mainly to the C— >T transition for AmpliTaq in 
tris/potassium acetate buffer A. Elimination of this artifact could improve 
AmpliTaq fidelity by more than 10-fold. Others have used cloning and screening 
methods to estimate polymerase error (Eckert et al., "High Fidelity DNA Synthesis 
by the Thermus Aquaticus DNA Polymerase," Nucleic Acids Res. . 
18(13):3739-44 (1990); Cline et al., "PCR Fidelity of Pfu DNA Polymerase and 
Other Thermostable DNA Polymerases," Nucleic Acids Res. , 24(1 8):3546-5 1 
(1996); Mattila et al., "Fidelity of DNA Synthesis by the Thermococcus Litoralis 
DNA Polymerase— An Extremely Heat Stable Enzyme with Proofreading 
Activity," Nucleic Acids Res. , 19(18):4967-73 (1991); and Huang et aL, "Fidelity 
and Predominant Mutations Produced by Deep Vent Wild-Type and Exonuclease- 
Deficient DNA Polymerases During in Vitro DNA Amplification," DNA Cell 
Biol. , 15(7):589-94 (1996)), and denaturing gradient gel electrophoresis (DGGE) 
has also been used (Keohavong et aL, PCR Meth. Appl. . 2:288-92 (1993); 
Keohavong et al., Proc. Natl. Acad. Sci. USA . 86)23):9253-7 (1989); Cariello et 
al., "Fidelity of Thermococcus Litoralis DNA Polymerase (Vent) in PCR 
Determined by Denaturing Gradient Gel Electrophoresis," Nucleic Acids Res. . 
19(15):4193-8 (1991); and Ling et al., "Optimization of the Polymerase Chain 
Reaction with Regard to Fidelity: Modified T7, Taq, and Vent DNA 
Polymerases," PCR Methods Appl. . l(l):63-9 (1991)). However, these methods 
do not directly measure mutated DNA, and do not detect all mutations. By 
cloning and DGGE methods, Vent polymerase has an error rate estimated from 0.3 
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to 4 xlO 5 /base/cycle (Cariello et al., "Fidelity of Thermococcus Litoralis DNA 
Polymerase (Vent) in PCR Determined by Denaturing Gradient Gel 
Electrophoresis," Nucleic Acids Res. . 19(15):4193-8 (1991); and Ling et al., 
"Optimization of the Polymerase Chain Reaction with Regard to Fidelity: 
Modified T7, Taq, and Vent DNA Polymerases," PCR Methods Appl. . l(l):63-9 
(1991)). The error rate of Tag polymerase has been estimated from 0.8 to 9 xlO 5 
/base/cycle (Eckert et al., "High Fidelity DNA Synthesis by the Thermus 
Aquaticus DNA Polymerase." Nucleic Acids Res. . 18(13):3739-44 (1990); Cline 
et al., "PCR Fidelity of Pfu DNA Polymerase and Other Thermostable DNA 
Polymerases," Nucleic Acids Res. . 24(18):3546-51 (1996); and Brail et al., 
"Improved Polymerase Fidelity in PCR-SSCPA," Mutat. Res. , 303(4): 171 -5 
(1993)), comparable to the error rate we observed for AmpliTaq in TsK, buffer A. 
Of the thermostable polymerases, Pfu has the lowest reported error rate estimated 
from 0.7 to 1.6 XlO" 6 /base/cycle (Cline et al., "PCR Fidelity of Pfu DNA 
Polymerase and Other Thermostable DNA Polymerases," Nucleic Acids Res. , 
24(1 8):3 546-51 (1996); Lundberg et al., "High-Fidelity Amplification Using a 
Thermostable DNA Polymerase Isolated from Pyrococcus Furiosus," Gene , 
108(1): 1-6 (1991); and Andre et al., "Fidelity and Mutational Spectrum of Pfii 
DNA Polymerase on a Human Mitochondrial DNA Sequence," Genome Res. , 
7(8):843-52 (1997)). Pfu polymerase may also exhibit improved fidelity in tricine 
or other low |ApKa| buffers. 

While high fidelity proofreading enzymes appeared to improve 
amplification from genomic DNA, proofreading still must be avoided in the 
conversion step. Different high fidelity genomic amplification conditions were 
tested in combination with fixed conversion conditions. Genomic amplification 
was performed with either Vent(exo-) Citrate/ammonium sulfate, buffer G, or 
Vent(exo-) Citrate/ammonium sulfate, buffer G, with 1 0% formamide buffer G(f) 
(see Table 4). Nonconversion primers were used with Vent(exo-) to optimize 
PCR fidelity in anticipation of conversion by mismatch primer extension. The 
highest fidelity conditions were as follows: Genomic amplifications with Vent/ 
Buffer G were initiated by spiking genomic amplification product from Expand/ 
Buffer C with 1 0% formamide reactions after 3 cycles. These Vent/ Buffer G 
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was provided. See Table 4 for observed error rate with other conditions tested. 



-74- 



Table 4 
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a Based on minimum of 50 total cycles, i.e. observed error +50. 

b No vent PCR product from genomic DNA. Used Taq amplified product for Vent conversion PCR, 
c Template added by taking 1 u.1 after 3rd PCR cycle from a parallel genomic DNA amplification using 
Expand polymerase mix in Buffer C. 



Table 4. Comparison of fidelity using proofreading and nonproofreading polymerases in different buffers for 
PCR to amplify the target sequence from genomic DNA and for conversion PCR. Taq and Vent polymerases 
were initially tested using one buffer for genomic amplification and conversion. During the conversion step, 
only nonconversion of the Msp\ site near p53 codon 248 was performed using short perfect match primers 
(Figure 1 1 A) to determine the background level of polymerase error. LDR quantified Msp\ site mutations at 
the second position (CCGG — » CNGG). Fidelity was compared in parallel reactions using proofreading and 
nonproofreading polymerases in genomic amplification and conversion. Expand polymerase mix, Taq with 
proofreading Pfu polymerase added, was used to initiate target amplification from genomic DNA for 
subsequent high fidelity Vent polymerase PCR. Vent polymerase was substituted with nonproofreading 
Vent(exo-) to determine whether proofreading was required and also in the conversion step where 
proofreading is not permitted. The effect of 1 0% formamide in the conversion PCR buffer was also tested. 
All buffers contained 200 ug/ml bovine serum albumin, 2.5 mM MgCh, and 200 uM dNTP (each). Specific 
components were: A (TsK), 20 mM Tris pH 9. 1 , 50 mM potassium acetate (standard Taq polymerase buffer). 
B (TsN): 20 mM Tris pH 9.1, 16 mM ammonium sulfate (standard Vent polymerase buffer). C (TcK): 20 
mM tricine pH 8.7, 50 mM potassium acetate. D (TcN): 20 mM tricine pH 8.7, 16 mM ammonium sulfate. 
E (EpIC): 20 mM EPPS pH 8.4, 50 mM potassium acetate. F (EpN): 20 mM EPPS pH 8.4, 16 mM 
ammonium sulfate. G (CiN): 20 mM citrate pH 7.6, 16 mM ammonium sulfate. (0 indicates presence of 
1 0% formamide. (4) indicates increase to 4 mM MgCh. (8) indicates increase to 8 mM MgCh- 



PCR conditions were found for each step in PCR/RE/LDR that 
maintain high fidelity when no mismatch conversion was performed. Using 
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known high fidelity PCR conditions, conversion was also tested by changing the 
p53 codon 248 Mspl site (CCGG) into a Taql site (TCGA). MK (CGGG) was 
added as before in parallel reactions to measure fidelity relative to the initial wild- 
type DNA present. High fidelity PCR was performed as described above and 
some (but not all) reactions were subjected to preconversion. Preconversion was 
performed using primers containing the degenerate pyrimidine nucleotide analog 
Qe at their 3' ends, as shown in Figure 10B. The final conversion was 
accomplished using natural base 3' T mismatch primers. Products were detected 
using LDR to interrogate the second base position in the Mspl, Taql and MK 
sequence: CNGG or TNGA. Fidelity for conversion with and without 
preconversion was compared to a nonconversion control. Successful conversion 
will change the Mspl site (CCGG) into a Taql site (TCGA); MK will also be 
converted from CGGG to TGGA. However, the main issue of conversion success 
is the maintenance of the central bases in all cases: CpG for Taql conversions and 
GpG for MK. Figure 13 shows the results of conversion. In Figure 13, lanes 1-4 
(C:G) are nonconverted reactions that were digested with Mspl; lanes 5-8 (Qe:G) 
are preconverted/converted reactions that were digested with Taql; lanes 9-12 
(T:G) are converted reactions lacking preconversion that were digested with Taql. 
PCR/RE/LDR with no conversion was sensitive to better than 1 in 1 0 4 using the 
previously determined best conditions for preamplification and conversion shown 
in Figure 13, C:G lanes 1-4. PCR/RE/LDR with conversion of the Mspl site to a 
Taql site by T mismatch primers was apparently very successful at first glance, 
Figure 13, T:G lanes 9-12. As would be expected for successful conversion, no 
Mspl product can be detected in the _CG_ region of the figure; hence, it appears 
that the site was converted to Taql and then digested. However, although a very 
large fraction of MK (CGGG) is observed in the reactions with added MK, 
Figure 13, T:G lanes 9-11; the same large fraction is also observed in the 0 MK 
control lane in Figure 13, T:G lane 12. Thus, the entire quantity of MK is an 
artifact produced by mismatch extension of the natural base T primers. This event 
would convert the second position C in the Mspl site to a G during extension, 
mimicking the internal sequence of MK (CCGG— >TGGA). Preconversion with 
primers eliminates the MK artifact, Qe:G lanes 5-8. 
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The greater amount of WT present in noncon verted samples, shown 
in Figure 13 lanes 1-4, compared to Qe converted samples, lanes 5-8, may be due 
to inhibition of Mspl digestion by formamide. Formamide apparently inhibits 
Mspl digestion as evidenced by the presence of strong wild-type LDR bands (WT) 
in the nonconversion control, C:G lanes, which are not present after Taql digestion 
of the converted sequence, Qe:G and T:G lanes. 

The low amount of MK product seen in the Qe 10" 4 and 10" 5 MK 
lanes (Figure 13, Qe:G lanes 6, 7) compared to the respective nonconversion 
control reactions, (Figure 13,C:G lanes 2, 3) maybe due to low efficiency of MK 
conversion. The production of a Taql site actually requires two conversions, one 
on each side of the central CpG dinucleotide. Lowering the concentration of MK 
10-fold may reduce MK conversion far more than 1 0-fold. Thus, with only one 
side of the MK sequence converted in a large amount of its product, one half of 
the LDR probes will be unable to properly hybridize to this sequence and ligation 
will not occur. LDR detection will only reveal the lesser quantity of fully 
converted template. Nevertheless, the amount of MK product is greater than the 
control in these two lanes. Compare Figure 13, lane 8 to Figure 13, lanes 6 and 7. 
While formamide may reduce conversion efficiency, conversion fidelity is greatly 
improved. 

Newton et al., "Analysis of Any Point Mutation in DNA. The 
Amplification Refractory Mutation System (ARMS)," Nucleic Acids Res. . 
17(7):2503-16 (1989) found that C • T, A • A, and T • T mismatches were all far 
more difficult to extend with Taq polymerase than purine-pyrimidine mismatches. 
These results reflect PCR efficiency of extension rather than fidelity. Others have 
observed low fidelity in extending natural base mismatches (O'Dell et al., Genome 
Res. , 6(6):558-68 (1996); and Eiken et al., "Application of Natural and 
Amplification Created Restriction Sites for the Diagnosis of PKU Mutations," 
Nucleic Acids Res. , 19(7):1427-30 (1991)). Use of a nucleotide analog with 
structural similarities to multiple bases could potentially be used to allow 
polymerase extension (reading) from the analog when paired with different bases 
and insertion of different bases opposite the analog (writing). For the purposes of 
the present invention, the efficiency of the preconversion process need not 
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necessarily be high. However, successful conversion requires high PCR fidelity to 
ensure that only the bases targeted for conversion are altered. False-positive 
mutation artifacts will result from alterations of bases not targeted for conversion 
within the sequence probed for mutations. Preconversion using 3' Qe primers 
forming a Q6:G mismatch avoids starting polymerase extension with a G • T 
mismatch. In subsequent amplification cycles, A is apparently written frequently 
opposite Q6. This observation is consistent with the results of Hill et al. in which 
Q6 base-paired like C and T with nearly equal frequency (Hill et al., "Polymerase 
Recognition of Synthetic Oligodeoxyribonucleotides Incorporating Degenerate 
Pyrimidine and Purine Bases," Proc. Natl. Acad. Sci. USA . 95(8):4258-63 
(1998)). Facile tautomerization allows Qe to mimic either pyrimidine when base- 
paired and avoids mismatch wobble. When the natural base primer is added after 
preconversion, a significant quantity of perfect match template already exists, 
otherwise the MK artifact would appear in the reaction regardless of 
preconversion. Other nucleotide analogs in addition to Q6 may serve as a bridge 
for more efficient conversions (Day et al., Nucleic Acids Res. , (1999)). 

The fidelity of polymerase extension from primers in PCR has been 
measured, and conditions were found, which in some cases improve fidelity. 
Presumably, higher fidelity resulted from a decrease in polymerase 
misincorporation, primer slippage, and template degradation. PCR/RE/LDR 
allows the measurement of very low-level "mutant" sequences by preferentially 
amplifying non-wild-type sequences. The method of the present invention clearly 
demonstrates that natural base mismatch primer extension cannot be used as a 
general technique to create restriction sites at will in any sequence for RFLP 
analysis. As shown in Figure 13 and observed previously (O'Dell et al., Genome 
Res. , 6(6):558-68 (1996); and Eiken et al., "Application of Natural and 
Amplification Created Restriction Sites for the Diagnosis of PKU Mutations," 
Nucleic Acids Res. . 19(7):1427-30 (1991)), natural base mismatch extension is 
prone to error. To perfectly engineer a restriction site from existing sequence, an 
error-free approach is required. The results of these examples, using the method 
of the present invention, indicates that the use of nucleotide analogs combined 
with high fidelity PCR conditions may radically decrease errors. Monitoring the 
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true specificity of primer extension was possible in these studies because LDR can 
measure specific PCR errors accurately and with high sensitivity. Thus, the 
products of different polymerases and buffers could be assayed at different steps 
during PCR/RE/LDR to maximize both PCR efficiency and fidelity. As a result, a 
PCR/RE/LDR strategy could be assembled to achieve the goal of 1 0 5 sensitivity. 
However, this highest sensitivity was achieved only in the special case of no 
conversion at a preexisting Mspl site. At this time, primer slippage remains an 
important mechanism through which mismatch primer extension errors can arise 
(Day et al., Nucleic Acids Res. . (1999)). Although the importance of this source 
of error in vivo is uncertain, it may have a dramatic impact on allele-specific PCR 
and other in vitro methods of mutation detection. An additional source of error 
arises from using natural base primers to select specific sequences for 
amplification following preconversion with nucleotide analogs. This is because a 
fraction of the selective natural base primers may form a mismatched pair with 
bases other than the intended base. It is known that a characteristic set of different 
bases insert opposite nucleotide analogs (Day et al., Nucleic Acids Res. . (1999); 
and Hill et al., "Polymerase Recognition of Synthetic Oligodeoxyribonucleotides 
Incorporating Degenerate Pyrimidine and Purine Bases," Proc. Natl. Acad. Sci. 
USA . 95(8):4258-63 (1998)). Thus, a high fidelity mismatch primer extension 
protocol awaits the development of new convertides that can overcome these 
problems. In combination with high fidelity PCR and LDR monitoring of 
efficiency, mismatch primer extension may become a technique for the precise 
introduction of desired mutations without artifacts. 

Although the invention has been described in detail for the purpose 
of illustration, it is understood that such details are solely for that purpose. The 
variations can be made therein by those skilled in the art without departing from 
the spirit of the scope of the invention which is defined by the following claims. 



WHAT IS CLAIMED 



hr A m e thod for identifying on e or mor e low abundance 
sequences diff e ring by one or mor e singl e bas e changes, insertion s , or d e l e tions, 
from a high abundance sequence in a plurality of target nucleotide sequ e nc e s 
comprising: 

providing a sampl e potentially containing one or more low 
abundanc e target nucleotid e sequ e nc e s with at least one s e qu e nc e differenc e each 
from th e high abundance targ e t s e qu e nc es ; 

providing a primary oligonucl e otid e prim e r set charact e riz e d by (a) 
a first oligonucleotide prim e r containing a target specific portion, and (b) a s e cond 
oligonucl e otid e primer containing a targ e t specific portion, wh e r e in th e primary 
oligonucl e otid e primers are suitable for hybridization on compl e m e ntary strands 
of a corresponding high and low abundance target nucleotide sequ e nc e s to permit 
formation of a polymerase chain r e action product, but hav e a mismatch which 
interferes with formation of such a polym e rase chain reaction product when 
hybridized to any oth e r nucleotid e sequence present in the sampl e ; 

providing a polymerase; 

blending the sample, the primary oligonucleotide prim e rs, and the 
polymerase to form a primary polym e ras e chain r e action mixtur e ; 

subj e cting th e primary polym e ras e chain r e action mixtur e to two or 
mor e polym e ras e chain reaction cycl e s comprising a denaturation treatment, 
wh e rein hybridiz ed nucl e ic acid s e qu e nc e s ar e separat e d, a hybridization 
treatm e nt, wherein th e targ e t specific portions of th e primary oligonucl e otid e 
primers hybridiz e to the target nucl e otide sequences, and an extension tr e atm e nt, 
wh e r e in the hybridized primary oligonucleotid e primers are e xtended to form 
primary e xt e nsion products complementary to th e targ e t nucleotid e se qu e nc e to 
which th e primary oligonucl e otid e primer is hybridiz e d; 

providing a s e condary oligonucl e otid e prim e r set characteriz e d by 
(a) a first oligonucleotide prim e r, having a targ e t specific portion and a 5* 
up s tr e am secondary primer sp e cific portion, and (b) a s e cond oligonucleotide 
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primer, having a target sp e cific portion and a 5 f upstream secondary primer 
specific portion, wher e in the secondary oligonucleotide primers in a particular set 
are s uitabl e for hybridization on complementary s trands of th e primary extension 
products to permit formation of a s e condary polymerase chain reaction product 
which contains or creates a restriction endonucl e as e recognition s it e wh e n 
amplifying th e high abundance target, but does not contain or creat e a restriction 
e ndonucleas e recognition site when amplifying the one or more low abundanc e 
targ e ts; 

providing a polymerase; 

bl e nding the primary e xt e nsion products, the secondary 
oligonucleotide primers, and the polymeras e to form a secondary polym e ras e 
chain reaction mixture; 

subjecting the secondary polymerase chain r e action mixture to two 
or mor e polym e rase chain reaction cycles comprising a denaturation tr e atment, 
wher e in hybridized nucl e ic acid sequences ar e s e parat e d, a hybridization 
tr e atment, wh e r e in th e secondary oligonucl e otid e prim e rs hybridiz e to th e primary 
e xtension products, an e xtension treatment, wher e in th e hybridiz e d s e condary 
oligonucleotide primers are e xt e nded to form s e condary e xt e nsion products 
complementary to the primary e xt e n s ion products, wh e r e in high abundanc e 
s e condary e xt e nsion products contain a restriction sit e but low abundanc e 
s e condary e xt e nsion products do not; 

providing a r e striction e ndonucleas e ; 

bl e nding th e secondary ext e nsion product and th e restriction 
endonucl e as e to form an e ndonucl e as e dig e stion reaction mixtur e ; 

subj e cting th e e ndonucleas e digestion r e action mixtur e to an 
endonucl e as e digestion reaction such that the r e striction e ndonucl e ase r e cognizes 
and cl e aves th e r e striction endonucl e as e r e cognition sit e contained within or 
cr e at e d wh e n amplifying th e high abundanc e targ e t but not the low abundanc e 
targ e t in the secondary e xt e n s ion products, thus s e l e ctiv e ly destroying th e high 
abundanc e s e condary e xtension product s ; 

providing a t e rtiary oligonucl e otid e prim e r sot charact e riz e d by (a) 
a fir s t tertiary prim e r containing th e sam e s e quenc e as th e 5 1 upstr e am portion of 
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th e first oligonucl e otide primer of tho s e condary oligonucleotid e primer s e t, and 
(b) a second tertiary primer containing the same sequenc e as th e 5 T upstream 
portion of a second oligonucleotide primer of the s e condary oligonucleotide 
primer set, wh e rein the set of tertiary oligonucleotide prim e rs may be used to 
amplify all of the secondary e xt e nsion products; 

blending th e secondary e xt e nsion products, the tertiary 
oligonucleotide prim e r s e t, and the polymeras e to form a tertiary polym e ras e chain 
reaction mixture; 

subjecting the tertiary polymerase chain reaction mixture to two or 
more polymerase chain r e action cycles comprising a denaturation treatment, 
wh e r e in hybridized nucleic acid sequences ar e se parated, a hybridization 
tr e atment, wher e in th e t e rtiary oligonucleotid e prim e rs hybridiz e to th e secondary 
e xtension products, an e xtension tr e atment, wherein th e hybridized t e rtiary 
oligonucleotide primers are ext e nded to form t e rtiary e xt e nsion products 
complementary to the s e condary e xt e nsion products; 

providing a plurality of oligonucl e otide probe sets, each set 
characterized by (a) a first oligonucleotide prob e , having a tertiary e xtension 
product s pecific portion and a detectable report e r label, and (b) a s e cond 
oligonucl e otide probe, having a tertiary e xt e n s ion product - specific portion, 
wher e in th e oligonucleotide probes in a particular set are suitable for ligation 
together wh e n hybridized adjac e nt to one another on a compl e mentary tertiary 
e xt e nsion product sp e cific portion, but have a mismatch which interferes with 
such ligation when hybridiz e d to any other nucl e otid e s e qu e nce present in th e 
sampl e ; 

providing a ligase; 

blending th e t e rtiary e xtension product, th e plurality of 
oligonucleotid e probe sets, and th e ligas e to form a ligas e d e t e ction r e action 
mixtur e ; 

subj e cting th e ligase detection r e action mixtur e to one or mor e 
ligas e d e tection reaction cycl e s comprising a d e naturation tr e atm e nt, wh e r e in any 
hybridiz e d oligonucleotid e s ar e s e parat e d from th e tertiary extension products, and 
a hybridization treatm e nt, wh e r e in th e oligonucl e otid e prob e s e ts hybridiz e at 
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adjac e nt positions in a bas e specific manner to their resp e ctive t e rtiary extension 
products, if present, and ligat e to on e another to form a ligation product s e qu e nce 
containing (a) th e detectable r e porter label and (b) th e t e rtiary e xt e nsion product 
specific portions conn e ct e d togeth e r, wherein th e oligonucleotid e prob e s e ts may 
hybridiz e to nucl e otid e s e quences other than their respectiv e compl e mentary 
t e rtiary e xt e nsion products but do not ligat e together du e to a pres e nce of on e or 
mor e mi s matches and individually s e parat e during th e d e naturation tr e atment; and 

d e t e cting th e reporter lab e ls of th e ligation product sequences, 
ther e by indicating the pres e nc e of on e or more low abundanc e target nucl e otid e 
s e qu e nc e s in the sampl e . 

2t A m e thod according to claim 1 , wher e in the oligonucl e otid e 
prob e s in a particular oligonucleotide prob e set hav e a uniqu e l e ngth so that th e 
ligation product s e qu e nces which th e y form can b e distinguish e d from other 
nucl e ic acids: 

s e parating the ligation product s e qu e nces by e l e ctrophoretic 
mobility prior to s aid d e tecting and 

distinguishing, after said detecting, th e ligation product sequences 
which differ in e l e ctrophor e tic mobility. 

A m e thod according to claim 1, wher e in th e s e cond 
oligonucleotide prob e of each oligonucleotide prob e s e t has an addr e ssable array 
sp e cific portion, said m e thod furth e r comprising: 

providing a solid support with differ e nt capture oligonucl e otides 
immobiliz e d at diff e r e nt particular sit e s, wher e in th e captur e oligonucleotid e s 
have nucleotid e s e quences compl e m e ntary to th e addressabl e array sp e cific 
portions and 

contacting th e ligas e d e t e ction r e action mixtur e , aft e r said 
subjecting it to one or more ligas e d e t e ction reaction cycles, with th e solid support 
und e r conditions effective to hybridize the ligation product s e qu e nc e s to th e 
captur e oligonucl e otides in a bas e specific manner, th e reby capturing th e 
addr es sabl e array - sp e cific portions to th e solid support at th e sit e with th e 
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eomplemcntary capture oligonucleotid e , wh e r e in said d e tecting indicat e s th e 
presence of ligation product sequenc e s captur e d using th e addressabl e array 
specific portions and immobilized to the solid support at particular sites, thereby 
indicating th e pres e nc e of on e or mor e targ e t nucleotid e sequ e nces in th e sample. 

4t A method according to claim 1 , wh e r e in th e r e lative 
amounts of on e or more of a low abundanc e s e quenc e , diff e ring by on e or mor e 
single base changes, insertions, d e letions, or translocations and present in a 
sample in unknown amounts with one or mor e high abundanc e sequ e nc e in a 
plurality of target nucl e otid e s, ar e quantified, said method further comprising: 

quantifying, aft e r said subjecting the primary polymerase chain 
reaction mixtur e to on e or more polym e ras e chain r e action cycles, the amounts of 
primary e xt e nsion products; 

providing a known amount of one or mor e marker targ e t nucleotide 

providing one or more sequ e nc e sp e cific prob e sets, including 
prob e sets specifically designed for th e mark e r target nucleotid e sequ e nces; 

blending th e marker targ e t nucleotide sequ e nc e s, and the prob e s e ts 
sp e cifically d e sign e d for the mark e r targ e t nucl e otid e sequ e nces with ligase 
detection reaction mixtur e ; 

quantifying th e amount of ligation product s e quences; and 

comparing th e amount of ligation product s e qu e nc e s g e n e rat e d 
from th e unknown low abundance sample with the amount of ligation product 
s e qu e nces g e n e rat e d from known amounts of mark e r targ e t nucl e otide s e quences 
to provid e a quantitative m e asur e of th e relativ e l e v e l of on e or mor e low 
abundance targ e t nucl e otide s e qu e nc e s in th e sampl e . 

Jh- A method according to claim 4, wh e r e in th e r e lativ e 
amounts of on e or mor e of a low abundance s e qu e nc e , diff e ring by one or mor e 
singl e bas e changes, ins e rtions, d e l e tions, or translocations and pr e sent in a 
sampl e in unknown amounts is pres e nt in less than a 1 : 1,000 molar ratio to th e 
amount of th e high abundanc e s e qu e nc e in a plurality of targ e t nucleotid e s. 
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6r A m e thod according to claim 4, wher e in th e r e lativ e 
amounts of one or mor e of a low abundance sequence, differing by one or mor e 
singl e base chang e s, insertions, del e tions, or translocations and pres e nt in a 
sampl e in unknown amounts is pres e nt in less than a 1 :1 0,000 molar ratio to the 
amount of th e high abundanc e sequ e nc e in the sample. 

7-r A m e thod according to claim 4, wh e rein the relative 
amounts of one or mor e of a low abundance sequence, differing by one or more 
singl e base changes, insertions, deletions, or translocations and pres e nt in a 
sample in unknown amounts is present in l es s than a 1 : 1 00,000 molar ratio to th e 
amount of the high abundance sequenc e in the sampl e . 

8^ A m e thod according to claim 1 , wh e r e in th e efficiency and 
accuracy of conv e rting th e high abundanc e primary polym e rase chain reaction 
product into a s e condary polym e ras e chain reaction product containing a 
r e striction endonucl e ase sit e is improved by p e rforming th e following st e p prior to 
th e providing the secondary oligonucl e otid e primer s e t: 

providing a pre s e condary oligonucl e otid e prim e r set characterized 
by (a) a first oligonucleotid e prim e r, having a targ e t specific portion, and (b) a 
s e cond oligonucl e otid e prim e r, having a targ e t sp e cific portion, wh e r e in the 
target sp e cific portions ar e id e ntical or substantially id e ntical to the s e condary 
oligonucleotid e prim e r s e t but at l e ast one prim e r contain s on e or more nucl e otid e 
analogs, wh e r e in th e oligonucl e otid e prim e rs in a particular set ar e suitabl e for 
hybridization on complem e ntary strands of th e primary e xt e nsion products to 
p e rmit formation of a pr e secondary polym e ras e chain r e action product which 
contains on e or mor e nucl e otid e analogs and opposit e strand bas e chang e s, 
wher e in the pr e secondary oligonucl e otid e prim e r s e t facilitat e s conversion of th e 
primary polymeras e chain reaction product sequ e nc e into a r e striction 
e ndonucl e as e r e cognition sit e in th e subs e qu e nt s e condary polym e ras e chain 
r e action; 

providing a polym e ras e ; 
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bl e nding tho primary extension products, th e pre secondary 
oligonucleotid e primers, and the polym e ra s e to form a pre secondary polym e rase 
chain r e action mixtur e ; 

subj e cting the secondary polymeras e chain r e action mixtur e to two 
or mor e polymeras e chain reaction cycl e s comprising a d e naturation tr e atm e nt, 
wherein hybridized nucl e ic acid s e qu e nc e s ar e separated, a hybridization 
treatment, wherein the s e condary oligonucleotide primers hybridiz e to th e primary 
ext e nsion products, an e xtension treatment, wherein the hybridized secondary 
oligonucleotid e primers ar e e xtended to form pre secondary extension products 
compl e mentary to th e primary extension products, wh e r e in th e pre s e condary 
e xtension products contain one or mor e nucleotide analogu e s and opposit e strand 
bas e changes which facilitate conversion of the primary polym e ras e chain reaction 
product s e quenc e into a r e striction endonucl e ase recognition sit e in th e subs e quent 
s e condary polymeras e chain reaction, wher e in the pr e secondary e xtension 
products ar e then used in place of th e primary e xtension products in th e s e condary 
polymerase chain r e action mixtur e . 

9t A method according to claim 8, wherein th e oligonucleotide 
prob e s in a particular oligonucleotide prob e sot have a unique length so that th e 
ligation product s e qu e nc e s which th e y form can bo distinguished from oth e r 
nucl e ic acids: 

s e parating th e ligation product s e qu e nc e s by e lcctrophorctic 
mobility prior to said d e tecting and 

distinguishing, after said d e t e cting, the ligation product sequ e nc e s 
which diff e r in e l e ctrophor e tic mobility. 

iOr A method according to claim 8, wherein th e s e cond 
oligonucl e otid e prob e of each s e t has an addressabl e array sp e cific portion, said 
m e thod furth e r comprising: 

providing a solid support with differ e nt capture oligonucl e otid e s 
immobiliz e d at diff e r e nt particular sit e s, wh e rein the capture oligonucleotid e s 
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havo nucleotide soquonccs compl e mentary to the addressabl e array specific 
portions and 

contacting th e ligase detection reaction mixtur e , after said 
subjecting it to on e or mor e ligas e detection r e action cycles, with th e solid support 
under conditions effective to hybridize the ligation product sequ e nces to th e 
captur e oligonucl e otides in a bas e specific mann e r, th e r e by capturing th e 
addressable array -s p e cific portion s to th e solid support at th e sit e with th e 
complementary captur e oligonucl e otid e , wh e r e in said det e cting indicates the 
presence of ligation product sequ e nces captured using the addressabl e array 
specific portions and immobilized to the solid support at particular sit e s, thereby 
indicating the presenc e of on e or mor e targ e t nucl e otide sequ e nc e s in the sample. 

A m e thod according to claim 8, wher e in th e relativ e 
amounts of on e or mor e of a low abundanc e sequ e nc e , differing by on e or more 
s ingl e bas e changes, ins e rtions, d e letions, or translocations and present in a 
sampl e in unknown amounts with on e or mor e high abundanc e sequenc e in a 
plurality of targ e t nucl e otid e , ar e quantified, s aid m e thod further comprising: 

quantifying, aft e r said subjecting the primary polym e rase chain 
reaction mixtur e to on e or mor e polym e ras e chain r e action cycles, the amounts of 
primary ext e nsion products; 

providing a known amount of one or more mark e r targ e t nucl e otid e 

sequences; 

providing one or mor e s e qu e nce specific probe sots, including 
prob e sets sp e cifically d e sign e d for th e mark e r targ e t nucl e otid e s e qu e nc e s; 

bl e nding th e mark e r targ e t nucleotid e s e quenc e s, and th e prob e s e ts 
sp e cifically design e d for th e mark e r targ e t nucl e otid e s e qu e nces with ligase 
d e t e ction r e action mixtur e ; 

quantifying the amount of ligation product sequ e nc e s; and 

comparing the amount of ligation product sequenc e s generated 
from th e unknown low abundanc e sampl e with th e amount of ligation product 
s e qu e nc e s gen e rated from known amounts of mark e r targ e t nucl e otid e s e qu e nc e s 
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to provide a quantitative m e asur e of the relative l e v e l of one or more low 
abundanc e targ e t nucl e otide s e qu e nces in tho sample. 

¥2tt A method according to claim 11, wherein the relative 
amounts of one or mor e of a low abundance sequence, differing by on e or more 
single base changes, insertions, deletions, or translocations and pres e nt in a 
s ampl e in unknown amounts is present in a molar ratio of than less than 1 : 1 ,000 to 
the amount of th e high abundanc e sequ e nc e in th e sample. 

Yh-r A m e thod according to claim 1 1 , wherein th e r e lativ e 
amounts of on e or more of a low abundanc e s e qu e nc e , differing by one or mor e 
single base changes, insertions, d e letions, or translocations and present in a 
sample in unknown amounts is present in a molar ratio of less than 1 : 10,000 to th e 
amount of th e high abundanc e sequ e nc e in th e sample. 

\A-r A method according to claim 1 1 , wh e r e in the relativ e 
amounts of one or more of a low abundance s e quenc e , differing by one or mor e 
single bas e changes, insertions, del e tions, or translocations and pres e nt in a 
sample in unknown amounts is pr e s e nt in a ratio molar ratio of less than 1 : 100,000 
to the amount of the high abundanc e s e quenc e in th e sampl e . 

A method according to claim 8, where the nucl e otide analog 
of at least on e oligonucleotid e primer of the pr e s e condary oligonucleotide prim e r 
s e t is at th e 3* e nd of th e prim e r. 

\%-r A m e thod according to claim 8, wh e r e th e nucleotide analog 
is sel e ct e d from tho group consisting of 1 (2' dcoxy 13 D ribofuranosyl)imidazol e 
A carboxamid e , 1 (2' deoxy 13 D ribofuranosyl) 3 nitropyrrol e , T d e oxyinosino, 

6 (2' deoxy J3 D ribofuranosyl) 6H,8H 3,1 dihydropyrimido[4,5 c][l,2]oxazin e 

7 on e , 2 amino 7 (2' d e oxy 13 D ribofuranosyl) 6 m e thoxyaminopurin e , 1 (2' 
dcoxy 13 D ribofuranosyl) A iodopyrazol e , 1 (2 r deoxy 13 D ribofuranosyl)pyrrol e 
3 carboxamid e , and 1 (2 r d e oxy 13 D ribofuranosyl) 1 nitropyrazol e . 
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4-7-r A method according to claim 1 further compri s ing: 
r e p e ating th e e ndonucl e a se dig e stion reaction aft e r said s ubjecting 
th e tertiary polym e ras e chain r e action mixtur e to two or more polym e ras e chain 
reaction cycles and after said subj e cting the ligas e d e t e ction reaction mixture to 
one or more ligas e d e t e ction r e action cycles, wh e r e in, during said repeating th e 
endonuclease digestion reaction, the restriction e ndonucl e ase r e cognizes and 
cl e aves the restriction endonuclease recognition sit e contained within any 
remaining high abundance target, thereby selectively destroying th e high 
abundance tertiary ext e nsion products. 

3-&r A kit for id e ntifying on e or more low abundanc e sequ e nces 
differing by one or more singl e base changes, ins e rtions, or d e l e tions, from a high 
abundance sequence in a plurality of targ e t nucl e otid e sequences comprising: 

providing a primary oligonucl e otide primer s e t characteriz e d by (a) 
a first oligonucleotide prim e r containing a targ e t sp e cific portion, and (b) a second 
oligonucleotide primer containing a targ e t sp e cific portion, wherein the primary 
oligonucleotide primers are suitabl e for hybridization on complem e ntary strands 
of a corresponding high and low abundance targ e t nucleotide s e qu e nces to permit 
formation of a primary e xt e nsion product, but have a mismatch which int e rfer e s 
with formation of such a polym e rase chain r e action product wh e n hybridiz e d to 
any oth e r nucl e otide s e qu e nc e present in th e sampl e ; 

providing a secondary oligonucl e otid e prim e r s e t charact e riz e d by 
(a) a first oligonucl e otid e prim e r, having a targ e t sp e cific portion and a 5 1 
upstream secondary primer sp e cific portion, and (b) a s e cond oligonucl e otid e 
prim e r, having a targ e t specific portion and a 5' upstream secondary prim e r 
s p e cific portion, wherein the s e condary oligonucl e otide prim e rs in a particular set 
ar e suitable for hybridization on complem e ntary strands of the primary extension 
products to p e rmit formation of a s e condary extension product which contains or 
cr e ates a r e striction e ndonucl e ase r e cognition sit e wh e n amplifying the high 
abundanc e targ e t, but do e s not contain or creat e a r e striction endonucleas e 
recognition sit e when amplifying th e on e or mor e low abundanc e targ e ts; 
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providing a tertiary oligonucleotid e prim e r s e t charact e riz e d by (a) 

a first t e rtiary primer containing the sam e sequence as th e 5 ! upstr e am portion of 
J th e first oligonucleotide primer of th e s e condary oligonucleotid e primer s e t, and 

J (b) a second t e rtiary primer containing th e sam e s e qu e nc e as the 5 1 up s tream 

portion of a se cond oligonucl e otid e primer of the secondary oligonucleotide 
| prim e r s e t, wher e in th e set of t e rtiary oligonucleotide primers may be us e d to 

amplify all of th e s e condary ext e nsion products; and 

providing a plurality of oligonucl e otid e prob e sets, each s e t 
| characteriz e d by (a) a first oligonucleotide probo, having a tertiary e xt e nsion 

| product specific portion and a dctcctablo reporter lab e l, and (b) a s e cond 

J oligonucleotide prob e , having a t e rtiary e xt e nsion product specific portion, 

J wherein the oligonucl e otide prob e s in a particular s e t ar e suitabl e for ligation 

| tog e th e r wh e n hybridiz e d adjacent to on e another on a compl e mentary t e rtiary 

e xt e nsion product specific portion, but hav e a mismatch which interferes with 
| such ligation when hybridized to any other nucl e otid e s e quence pres e nt in th e 

sampl e . 

| l-9r A kit according to claim 1 8 furth e r comprising: 

providing a pr e s e condary oligonucleotide primer s e t charact e riz e d 
I by (a) a first oligonucl e otid e prim e r, having a targ e t sp e cific portion, and (b) a 

| second oligonucl e otid e prim e r, having a targ e t sp e cific portion, wher e in the 

I targ e t s p e cific portion s ar e id e ntical or substantially identical to th e s e condary 

I oligonucl e otid e prim e r s e t but at l e ast on e primer contains on e or mor e nucl e otid e 

| analogs, wh e rein th e pr e s e condary oligonucl e otid e prim e r s in a particular s e t ar e 

| suitable for hybridization on complem e ntary strands of th e primary e xt e nsion 

| products to p e rmit formation of a pro secondary ext e nsion product which contains 

| one or more nucleotid e analogs and opposite strand base changes, wher e in the pr e 

I s e condary oligonucleotide primer sot facilitates conversion of the primary 

| e xt e nsion product s e qu e nc e into a restriction e ndonuclease r e cognition site in a 

subs e qu e nt secondary polym e ras e chain reaction. 



20r A kit according to claim 1 8 furth e r comprising: 
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a ligas e . 

24-r A kit according to claim 1 8 further comprising: 
a polym e ras e . 

23rr A kit according to claim 18 further comprising: 
a restriction e ndonucleas e . 

ABSTRACT OF THE DISCLOSURE 

The present invention provides a method for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence in a plurality of target 
nucleotide sequences. The high abundance wild-type sequence is selectively 
removed using high fidelity polymerase chain reaction analog conversion, 
facilitated by optimal buffer conditions, to create a restriction endonuclease site in 
the high abundance wild-type gene, but not in the low abundance mutant gene. 
This allows for digestion of the high abundance DNA. Subsequently the low 
abundant mutant DNA is amplified and detected by the ligase detection reaction 
assay. The present invention also relates to a kit for carrying out this procedure. 



